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(57) Abstract 

Transformed plant cells which have increased starch content are disclosed. Also disclosed are whole plants comprising 
plant cells which express CTP/ADP glucose pyrophosphorylase genes. 
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INCREASED STARCH CONTENT IN PLANTS 

This is a continutation-in-part of a co-pending U.S. 
application having serial No. 07/539,763, filed on June 18, 1990 
and entitled "Increased Starch Content in Plants". 

Recent advances in genetic engineering have provided 
the requisite tools to transform plants to contain foreign genes. 
It is now possible to produce plants which have unique 
characteristics of agronomic and crop processing importance. 
Certainly, one such advantageous trait is enhanced starch 
content and quality in various crop plants. 

Starch is a polysaccharide primarily composed of 
glucose units connected by alpha 1-4 and alpha 1-6 linkages. It 
-jrj is found in plant cells as water-insoluble grains or granules. 
During photosynthesis, starch is produced and stored in 
chloroplasts. Starch is also synthesized in roots and storage 
organs such as tubers and seeds. In these non-photosynthetic 
tissues, the starch is found in a form of plastids called 
2^ amyloplasts. As in the chloroplasts, starch is stored in the 
amyloplasts as starch granules. The size of the granules varies 
depending on the plant species. 

Starch is actually composed of amylose and 
amylopectin, two distinct types of glucose polymers. Amylose is 
25 composed primarily of linear chains of alpha 1-4 linked glucose 
molecules. On average, amylose has a chain length of about 
j 1000 glucose molecules. Amylopectin contains shorter chains 

J linked together with alpha 1-6 linkages. On average, 

j amylopectin has a chain length of about 20-25 glucose molecules, 

j Until recently, there was controversy in the literature 

j as to whether ADPglucose or UDPglucose was the substrate for 

j starch synthesis. With -the isolation of Arabidopsis mutants 

lacking ADPglucose pyrophosphorylase it is now accepted that 
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plan ts use ADPglucose as the substrate for starch s^thesi. 
There are three steps in the synthesis of starch All thes 

♦ t*ke place within the chloroplasts or amyloplasU. In 
reactions duced from gluc0 se-l-phosphata 

the first step, ADPglucose is P 

and ATP by ADPglucose pyrophosphorylase to 
second sup, ADPgiucose is used by starch synthase <E ^ • 
u form linear chains of starch containing the a. M hnkage^ 
^ third SUP. the branching enzyme.s, (EC 2.4 OS, urtroduce 
7. . . , J a „ s to produce the amylopectin molecule. 

controUmg step m starch b jm ^ bUcation num ber 

„ In fact. European PaUnt ^^*p^ pyrophosphorylase, 
AfjcosnR A2 which concerns ADPglucose pj* f 
0368506 A2 whi ^ atep m 

20 ^ . t TL46 was found to contain only about 5% of the 

This mutant, TL4b, was iuuh 

type plants. However, M P hosphory lase is 

the wild type starch levels. If Aur giuco* fj 

the wild iyp ve cted a 95% 

« the rate limiting enzyme, one woui 

25 o^ivitv to produce more than a w« 

reduction in enzyme activity pr 

» .. in _ tarC h accumulation. Similarly, tne m 
reduction in starcn *^ enzyme can 

^ „„ pxtractable activities suggest this enzym 

rrrruXg"- * - — - * 

„ Ubited by the aUosUhc regulators of the enzyme acfvuy. 
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ftTfMMARY OF THE INVENTION 

The present invention provides structural DNA 
constructs which encode an APPglucose pyrophosphorylase 
(ADPGPP) enzyme and which are useful in producing enhanced 
starch content in plants. It is also demonstrated that the 
ADPGPP enzyme activity in plant cells and tissues is a 
controlling step in starch biosynthesis. 

In accomplishing the foregoing, there is provided, in 
accordance with one aspect of the present invention, a method of 
producing genetically transformed plants which have elevated 
starch content, comprising the steps of: 

(a) inserting into the genome of a plant cell a 
recombinant, double-stranded DNA molecule 
comprising 

(i) a promoter which functions in plants to 
cause the production of an RNA sequence 
in target plant tissues* 

(ii) a structural DNA sequence that causes the 
production of an RNA sequence which 
encodes a fusion polypeptide comprising an 
amino- terminal plastid transit peptide and 
an ADPglucose pyrophosphorylase 
enzyme, 

(iii) a 3* non-translated DNA sequence which 
functions in plant cells to cause 
transcriptional termination and the 
addition of polyadenylated nucleotides to 
the 3' end of the RNA sequence; 

(b) obtaining transformed plant cells; and 
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(c) regenerating from the transformed plant cells 
genetically transformed plants which have an 
elevated starch content. 
In accordance with another aspect of the present 
invention, there is provided a recombinant, double-stranded 
DNA molecule comprising in sequence: 

(a) a promoter which functions in plants to cause the 
production of an RNA sequence in target plant 
tissues; 

(b) a structural DNA sequence that causes the 
production of an RNA sequence which encodes a 
fusion polypeptide comprising an amino- 
terminal plastid transit peptide and an 
ADPglucose pyrophosphorylase enzyme; and 

15 (c) a 3" non-translated region which functions m 

plant cells to cause transcriptional termination 
and the addition of polyadenylated nucleo-tidea to 
the 3* end of the RNA sequence, said promoter 
being heterologous with respect to the structural 
20 DNA. 

There ha* also been provided, in accordance with 
another aspect of the preeent invention, bacterial and 
transformed plant cells that contain, respect,™ , 
comprised of (he above-mentioned elements (a), (b) and (c). 
25 In accordance with yet another aspect of the present 

invention, differentiated plants are provided that have increased 
starch content. 



30 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the nucleotide sequence (SEQ ID NO:l) 
and deduced amino acid sequence (SEQ ID NO:2) for the 
ADPglucose pyrophosphorylase (glgC) gene from E. colL 

Figure 2 shows the nucleotide sequence (SEQ ED NO:3) 
and deduced amino acid sequence (SEQ ID NO:4) for the mutant 
ADPglucose pyrophosphorylase (glgCl6) gene from£. colL 

Figure 3 shows the nucleotide sequence (SEQ ID NO:5) 
and corresponding amino acid sequence (SEQ ID NO:6) for the 
modified chloroplast transit peptide from the ssRUBISCO 1A 
gene from Arabidopsis thaliana. 

Figure 4 shows a plasmid map for plant transformation 
vector pMON530. 

Figure 5 shows the nucleotide sequence (SEQ ED NO:7) 
and the corresponding amino acid sequence (SEQ ID NO:8) of the 
assembled small subunit ADPglucose pyrophosphorylase gene of 
potato. 

Figure 6 shows the near full length nucleotide sequence 
(SEQ ID NO:9) and the corresponding amino acid sequence (SEQ 
ID NO: 10) of the almost complete large subunit ADPglucose 
pyrophosphorylase gene of potato. 

Figure 7 shows a plasmid map for plant transformation 
vector pMON20113. 

Figure 8 shows a plasmid map for plant transformation 
vector pMON16938. 

Figure 9 shows a plasmid map for plant transformation 
vector pMON977. 

Figure 10 shows a plasmid map for plant 
transformation vector pMON 16950. 
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Figure U shows a plasmid map for plant 
transformation vector P MON10098. 

Tfce expression of a plant gene which exists in double- 
t , ed DNA form involves transcription of messenger RNA 
stranded DNA tonn polymerase 

(mRNA) from one mRNA primary 

enzyme, and pissing involves a 3' non- 

3 ' end ° f ;lTcription of DNA into mRNA i. regulated by a 

region of DNA usually referred to as the ™* 
region o . se quence of bases that Bignals RNA 

polymerase to associav* str ands as a 

. ^ m DNA using one of the uwa 5irmJU 

*> ^ A number of promoters which are active in plant cells 
a • w in the literature. These include the 
have been described in tne uw oromo ters 
. , Mn m w octopine synthase (OCS) promote™ 

(which are c^oviru. promoter. such 

soccer.™ 19S ^ 35S and the 

" 35S™ - Hght-indncibie 

Ggwort mosa. . 3 s 5 ub ^ t of ^^,5-bU-phosphaU 

promoter from Ae smaU . peptide), 
carboxylase (ssRUBlSCO, a very a r 
and the chiorophyll a/b binding proteu. gene promoter e«. 
30 of these promo** have been used to create venous type. D 
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constructs which have been expressed in plants; see, e.g., PCT 
publication WO 84/02913 (Rogers et al., Monsanto). 

Promoters which are known or are found to cause 
transcription of RNA in plant cells can be used in the present 
invention. Such promoters may be obtained from a variety of 
sources such as plants and plant viruses and include, but are 
not limited to, the enhanced CaMV35S promoter and promoters 
isolated from plant genes such as ssRUBISCO genes. As 
described below, it is preferred that the particular promoter 
selected should be capable of causing sufficient expression to 
result in the production of an effective amount of ADPglucbse 
pyrophosphorylase enzyme to cause the desired increase in 
starch content. In addition, it is preferred to bring about 
jg expression of the ADPGPP gene in specific tissues of the plant 
such as leaf, root, tuber, seed, fruit, etc. and the promoter 
chosen should have the desired tissue and developmental 
specificity. Those skilled in the art will recognize that the 
amount of ADPglucose pyrophosphorylase needed to induce the 
desired increase in starch content may vary with the type of 
plant and furthermore that too much ADPglucose 
pyrophosphorylase activity may be deleterious to the plant. 
Therefore, promoter function should be optimized by selecting a 
promoter with the desired tissue expression capabilities and 
2g approximate promoter strength and selecting a transformant 
which produces the desired ADPglucose pyrophosphorylase 
activity in the target tissues. This selection approach from the 
pool of transformants is routinely employed in expression of 
heterologous structural genes in plants since there is variation 
^ between transformants containing the same heterologous gene 
due to the site of gene insertion within the plant genome. 
(Commonly referred to as "position effect"). 
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It is preferred that the promoters utilized in the double- 
stranded DMA molecules of the present invention have relatively 
high expression in tissue, where the increased starch content ,s 
desired, such as the tuber of the potato plut and the fruit of 
tomato. In potato, a particularly preferred promoter m Una 
regard is the patatin promoter described herein in greate. ■ detaj 
in the accompanying example,. Expression of the double- 
stranded DNA molecules of the present ir.vent.on by a 
constitutive promoter, expressing the DNA molecule ,n all or 
10 most of the tissues of the plant, will be rarely preferred and may. 
in some instances, be detrimental to plant growth 

The class I patatin promoter, used in thrs study to 
express the E. coli ADPGPP. has been shown to be both lugUy 
active and tuber-spedf-c (Sevan et al., 1986; Jefferson et al 
15 l990 ). A number of other gene, with tuber-sp^dfic or enhanced 
expression are known, including the potato tuber ADPGPP 
genes (Muller et a!.. 1990). sucrose synthase (Salanouba^ ^and 
Belliard. 1987, 1989). the maior tuber proteins tnctafang the 22 kd 
protein complexes and proteinase inhibitor, (Hannape 1 1990 . 
20 and the other class I and H paUtins (Bocha-Sosa et al.. 1989, 

Migneryetal.,1988). Anp.lucose 
In addition to the endogenous plant ADPglucose 
pyrophosphatase promoter,, otter promoter, can also be used 
^express an ADPglucose pyrophosphorylase gene ,n specie 
* tissues, such as leaves, seeds or fruits. 

known as the 73 protein) is one of the maior storage protems 
soybean (Glycine max) (Tierney. 1987). The promoter for B- 
Iglycinin could be used to over-expres, the E. col,, or any 
other, ADPglucose pyrophosphorylase gene. 
30 seeds, which would lead to an increase is the starch content of 
the seeds. The fl-subunit of B-conglycinin has been expressed. 



WO?1/19806 



PCT/US9 1/04036 



q 



10 



15 



20 



25 



30 



using its endogenous promoter, in the seeds of transgenic 
petunia and tobacco, showing that the promoter functions in a 
seed-specific manner in other plants (Bray, 1987). 

The zeins are a group of storage proteins found in 
maize endosperm. Genomic clones for zein genes have been 
isolated (Pedersen, 1982), and the promoters from these clones 
could also be used to express an ADPglucose pyrophosphorylase 
gene in the seeds of maize and other plants. 

The starch content of tomato fruit can be increased by 
expressing an ADPglucose pyrophosphorylase gene behind a 
fruit specific promoter. The promoter from the 2A11 genomic 
clone (Pear, 1989) or the E8 promoter (Deikman, 1988) would 
express the ADPglucose pyrophosphorylase in tomato fruits. In 
addition, novel fruit specific promoters exhibiting high and 
specific expression during the development of the tomato fruit 
have been isolated. A differential screening approach utilizing a 
tomato fruit cDNA library was used to identify suitable cDNA 
clones that expressed specifically in green fruit cDNA probes 
prepared from mRNA extracted from fruit at early and late 
developing stages, from combined leaf+stem tissue, and from 
root tissue of the tomato plant were used. Clones that expressed 
abundantly in green fruit and that showed no detectable 
expression in leaves were identified. Genomic Southern 
analysis indicated a small (1-2) gene copy number. The 
promoters for these cDNA clones were then isolated by screening 
a tomato genomic clone bank. The expression pattern of these 
promoters is confirmed by fusion to the B-glucuronidase (GUS) 
gene and by following the expression of the GUS enzyme during 
development in transgenic fruit. Promoters that exhibit 
expression in most cells of the fruit are then fused to the CTP- 
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glgC l6 and other glgC alleles or the ADPGPP genes derived 
from either algae or plants. bv 
The starch content of root tissue can be xncreased by 
no ,„ ADPglucose pyrophosphorylase gene behind a 
The Promoter from the acid chitinase 
Tne Xl e t al.. 1990) would express the ADPglucose 
^pho^ase inroot tissue. °*"^ f 
be accomplished by utilizing the root speafic .ubdoman* , rf 
to CaMV35S promos that have been identified (Benfe, t al 
'0 * 89) The starch content of leaf tissue can be .ncreased by 
expressing the ADPglucose pyrophosphorylase gene (e.g. glgC 
gene) ling a leaf active promoter such as ssRUBISCO promoter 
„r chlorophyll a/b binding protein gene promoter. 

The KNA produced by a DNA construct of the present 
15 «„» also contains a 5' non-translated leader sequence. 

™s eTuen - » *"» ^ ^ 

T 1, and can be specifically modified so as to 

In^IIe — T ofThe mRNA. The S non— < 
"can also be obtained from viral RNAs. from su,tabU 
» Xryoac genes, or from a synthetic gene sequence The 
preset invention is not limited U, construct., as presen^ hn .he 
Lowing examples, wherein the non-trandated reg>on . 
"Lm J 5' non-translated sequence that accompame 
promoter sequence. Rather, the non-transUted leade 

25 ..„„ be derived from an unrelated promoter or coding 

sequence can be aenveu 

sequence as discussed above. 

The DNA constructs of the present .nvenUon also 
contain a structural coding sequence in double-stranded ^NA 
ZT which encodes a fusion polypeptide comprismg an anuno- 
» t*^- transit peptide and an ADPglucose 
p yZhosphorylase enzyme. The ADPglucose pyrophospho- 



WO 91/19806 



PCT/US9 1 / 04036 



1 i 

rylase enzyme utilized in the present invention is preferably 
subject to reduced allosteric control in plants. Such an 
unregulated ADPglucose pyrophosphorylase enzyme may be 
selected from known enzymes which exhibit unregulated 
enzymatic activity or can be produced by mutagenesis of native 
bacterial, or algal or plant ADPglucose pyrophosphorylase 
enzymes as discussed in greater detail hereinafter. In some 
instances, the substantial differences in the nature of regulators 
modulating the activity of the wild type ADPglucose 
pyrophosphorylase (ADPGPP) enzyme permits the use of the 
wild type gene itself; in these instances, the concentration of the 
regulators within plant organelles will facilitate elicitation of 
significant ADPGPP enzyme activity. 

15 

Bacterial ADPglucose Pvrophosphorvlasea 

The E. coli ADPglucose pyrophosphorylase has been 
well characterized as a tightly regulated enzyme. The activator 
fructose 1,6-bisphosphate has been shown to activate the enzyme 
by increasing its V max , and by increasing the affinity of the 

A) 

enzyme for its substrates (Preiss, 1966 and Gentner, 1967). In 
addition, fructose 1,6-bisphosphate (FBP) also modulates the 
sensitivity of the enzyme to the inhibitors adenosine-5*- 
monophosphate (AMP) and inorganic phosphate (Pj) (Gentner, 

25 1968). 

In 1981, the E. coli K12 ADPglucose pyrophosphorylase 
gene (gig C), along with the genes for glycogen synthase and 
branching enzyme, were cloned, and the resulting plasmid was 
named pOP12 (Okita, 1981). The gig C gene, which was 
30 sequenced in 1983, contains 1293 bp (SEQ ID NO:l) and encodes 
431 amino acids (SEQ ID NO:2) with a deduced molecular weight 
of 48,762 is shown in Figure 1 (Baecker, 1983). 
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The gl 8 CIS gene was generated ^ chemically 
• •„„ E coli K12 strain PA 601 with N-methyl-N 
m utagem«ng ^. oh Ki Creuzet . S i g al, 1972). 

nitrosoguanidn* CaUan^o, 

Glycogen bl0S ^ e i 7" Uta ; he C16 mutan t was found to 

fanatics of the gig o ^ ^ 

AUi-giu v f the activator, Fructose 1,6- 

ADPglucose in the absence 

bisphosphate (FBP), and the concentration of FBP needed o 

Jl activation of the enzyme was decreased in gig CIS. 

15 * C16 by r^O > CAMP) was also reduced- ^ 
The C16 gene from cou 
„ „, U86) Two clones, with opposite orientation, were 

were transformed mto £. «" m co/i K-12 G6MD3 

«W- P^«^ e H AC70H1-O04. 

, at" p'ltave- to sevens b £~ *~ 
25 .ctivittes. Both plasmids. pEBL and I pE 

ADPglucose pyrophosphorylase actmty » both mutan St 
The cloned ADPglucose pyrophosphorylase was pa^aUy 
purified from E. co« strain AC70R! transformed w.th * pEBL3 

plasmid. This en,yme was «P— "? " **** 

30 Lrified ADPglucose pyrophosphorylase from the ongma 
Tunt strain ( E . co«* 12 6181. and to the partially punned 
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ADPglucose pyrophosphorylase from E. coli K-12 strain 356, 
which is the wild type parent strain of strain 618. The wild type 
and mutant enzymes were compared in their levels of activation 
and inhibition. The parent strain 356 ADPglucose pyrophos- 
phorylase was activated about 45-fold with fructose 1,6- 
bisphosphate. The sigmoidal activation curve had a Hill slope of 
1.7 1 and 50% maximal stimulation was seen at 62 |iM FBP. The 
mutant strain 618 ADPglucose pyrophosphorylase was more 
active in the absence of FBP, and was activated only 1.8- to 2-fold 
with FBP. The activation curve for the 618 ADPglucose 
pyrophosphorylase was hyperbolic with a Hill slope of 1.0, and 
50% of maximal stimulation was seen at 15 +/-3.1 ^M. The 
enzyme expressed from the pEBL3 plasmid gave the same FBP 
kinetic constants as the ADPglucose pyrophosphorylase from 
mutant strain 618. 

The DNA sequence of the gig CIS gene is now known 
(SEQ ID NO:3) (Kumar, 1989). Referring to Figure 2, when the 
gig C16 deduced amino acid sequence (SEQ ID NO:4) was 
compared to the nonisogenic E. coli K-12 3000,, two amino acid 
changes are noted. The two changes are Lys 296 to Glu, and Gly 
336 to Asp. 

A number of. other ADPglucose pyrophosphorylase 
mutants have been found in E. coli. The expression of any of 
these or other bacterial ADPglucose pyrophosphorylase wild type 
or mutants could also be used to increase starch production in 
plants. 

E. coli K12 strain 6047 {gig C47) accumulates about the 
same amount of glycogen during stationary phase as does strain 
618 {gig C16). Strain 6047, like 618, shows a higher apparent 
affinity for FBP, and more activity in the absence of FBP. 
However, the enzyme from strain 6047 is reportedly more 
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9eneUi »e to inhibition by AMP compared to the enzyme from 
strain 618 (Laffl-Damotto, 1977)- 

The E »K B mutant, SG5, ha. a higher afiuuty for .* 
aUosteric activators and a .ower affinity for its anos«nc 
inhibitor, when compared to iu parent .tram (Govons. 1969 
tons, \m and Preiss, X973). These change, afcne m ak e * 
euzyme more active under physiological cond.t.ons. and tbi. 

the bactoria to accumulate two to three t-- « «* 
^o en as the parent strain. The mutant ADPglucose 
10 pyUhosphoryl.se from SG5. like the wild type, «.sts as a 
h I Lrame, Unlike the wi.d type, however, J causes th 
mutant enzyme to form higher weight oligomer. (Carison_ 19m 
The ADPglucose pyrophosphorylase from the E. «X. » 
m »tant strain CL1136-504 also ha. a higher apparent rifattrfer 
15 activator, and a iower apparent affinity for u**.tor. 
»1 and Prei... 1973). This mutant win 
four-fold more glycogen than the wild type E. col, Under 
^ated conditions, the purified CUUM04 enzyme and the 
wild type (AC70K1, enzyme have comparable sp«mc active. 
20 However, in the absence of any activator., the CL1136 504 
enzyme i. highly active, unlike the wild type enzyme 

The g l g C gene from Sd-^ ™ £ 

a!,o been cloned and sequenced (Leung and Preiss 1987a). The 
a ^.^ammoacidswithaded^^w^ 

same gene from iu. coti Ud - 
,evel and 80* identity at the DMA level. lake the &«* 
ADPglucose pyrophosphoryiase. the Sa/mone, to »P»" 
LT2 ADPglucose pyrophosphorylase i. also activated by FBP and 
30 is inhibited by AMP !Leung and Preiss ISHU. Tms substantial 
Nation L amino acid sequences suggests that introduction 
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of mutations which cause enhancement of ADPGPP activity in 
E. coli into S. typhimurium ADPGPP gene should have a similar 
effect on the ADPGPP enzyme of this organism. 

A number of other bacterial ADPglucose pyrophos- 
phorylases have been characterized by their response to 
activators and inhibitors (for review see: Preiss 1973). Like the 
Escherichia coli ADPglucose pyrophosphorylase, the 
ADPglucose pyrophosphorylases from Aerobacter aerogenes, 
Aerobacter cloacae, Citrobacter freundii, and Escherichia 
aurescens are all activated by FBP and are inhibited by AMP. 
The ADPglucose pyrophosphorylase from Aeromonas formicans 
is activated by fructose 6-phosphate or FBP, and is inhibited by 
ADP. The Serratia marcescens ADPglucose pyrophosphorylase, 
however, was not activated by any metabolite tested. The 
photosynthetic Rhodospir ilium rubrum has an ADPglucose 
pyrophosphorylase that is activated by pyruvate, and none of the 
tested compounds, including Pi, AMP or ADP, inhibit the 
enzyme. Several algal ADPglucose pyrophosphorylases have 
been studied and found to. have regulation similar to that found 
for plant ADPglucose pyrophosphorylases. Obviously, the 
ADPglucose pyrophospho-rylases from many organisms could 
be used to increase starch biosynthesis and accumulation in 
plants. 

In addition to E. coli and plant ADPGPP enzymes, other 
sources, including but not limited to cyanobacteria, algae, and 
other procaryotic and eucaryotic cells can serve as sources for 
ADPGPP genes. For example, isolation of the Synechocystis and 
the Anabaena ADPGPP genes could be performed using 
oligonucleotides corresponding to the E. coli ADPGPP activator 
site, (amino acid residues 25-42 of Figure 1), which is highly 
conserved across widely divergent species. Oligonucleotides 
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corresponding to this region would facilitate gone .soiafco »>- 
used as probes of genomic libraries. Alternately, the PGR 
re lon (described in Sample » «** 
segments of an ADPGPP gene by us.ng 5 -J 
Jresponding to the E. coli activator s.te. and 3 pnmers 
corresponding to E. co/i catalytic sites, for exampU^ the E_cob 
^Pglucose binding site. Product, of the PCR reason could be 
TJ as probes of genomic libraries for isolafon of the 
corresponding full length gene. 

10 

Plant A pPplirnse Prrmh^ hftrvlase3 

" f Z ^rLe!rop7u»=--t>-ought to be the pnmary 
substrate for starch biosynthesis in plants. Howeve, 
ADPglucose was found to be a better substrate for starch 
» ^thesis than UDPglucose (Kecondo 196U «~ 
report states that ADPglucose pyrophosphorylase acfvty was 

"I'Xfuaf ABP^ose pyrophosphate^ 
partially purified and was shown to be activated by 3- 
» pChogl'cerate (3-PGA) and inhibited by inorgamc phospha* 
G 1 !t al.. WO. The report by <^ « * 
the biosynthesis of leaf starch was regulated by the leve 
ADPgW. The activator. 3-PGA. is the primary product o 
rfol nxa.on in photosynthesis. During photosynthes,, the 
* Uvels of 3-PGA would increase, causing action 

ADPglucose pyrophosphoryU*.. At the same « ^ * 
V- would decrease because of photophosphorylafcon. decreasmg 
tne inhibition of ADPglucose pyrophosphorylase. 

biosynthesis. During darkness, 3-PGA levels w 
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and Pj levels would increase, decreasing the activity of 
ADPglucose pyrophosphorylase and, therefore, decreasing 
biosynthesis of ADPG and starch (Ghosh, 1966). 

The ADPglucose pyrophosphorylase from spinach 
leaves was later purified to homogeneity and shown to contain 
subunits of 51 and 54 kDa (Morell, 1987). Based on antibodies 
raised against the two subunits, the 51 kDa protein has 
homology with both the maize endosperm and potato tuber 
ADPglucose pyrophosphorylases, but not with the spinach leaf 
54 kDa protein. 

The sequence of a rice endosperm ADPglucose 
pyrophosphorylase subunit cDNA clone has been reported 
(Anderson, 1989a). The clone encoded a protein of 483 amino 
acids. A comparison of the rice endosperm ADPglucose 
pyrophosphorylase and the E. coli ADPglucose pyrophos- 
phorylase protein sequences 6hows about 30% identity. Also in 
1989, an almost full-length cDNA clone for the wheat endosperm 
ADPglucose pyrophosphorylase was sequenced (Olive, 1989). 
The wheat endosperm ADPglucose pyrophosphorylase clone has 
about 24% identity with the E. coli ADPglucose 
pyrophosphorylase protein sequence, while the wheat and the 
rice clones have 40% identity at the protein level. 

Further evidence for the existence of deregulated wild 
type plant ADPglucose pyrophosphorylases is found in the paper 
by Olive et aL (Olive, 1989). They claim that the wheat leaf and 
endosperm ADPglucose pyrophosphorylases have very different 
allosteric regulation. The endosperm ADPglucose 
pyrophosphorylase is. not activated by 3-PGA and requires ten 
times more of the inhibitor, orthophosphate, to achieve 50% 
inhibition than the leaf enzyme. 
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The maize endosperm ADPglucose pyrophosphorylase 
has been purified and shown to have catalytic and regulatory 
p pertiel similar to those of other plant ADPglucose 
pyrophosphorylases (Plaxton, 1987). The native molecular 
wTght of the maize endosperm enzyme is 230,000, and ,t » 
composed of four subunits of similar sue. 

The native molecular weight of the potato tuber 
ADPglucose pyrophosphorylase is reported to be 200,000. wilha 
su buLt size of 50.000 (Sowokinos, 1982). ActivUy of the tube 
ADPglucose pyrophosphorylase is almost 

on 3-PGA. and as with other plant ADPgluc se 
pyrophosphorylases. is inhibited by P, The potato tuber and leaf 
ADPglucose pyrophosphorylases have been demonstrated tab. 
sinnlar in physical, catalytic, and allosteric propertxes 
(Anderson. 1989b). 



MiUa ^ se sKUed in the art will recognize that while not 
20 absolutely required, enhanced results are to be obtained by u*ng 
ADPglucose pyrophosphorylase genes which are subject t. 
redued allosteric regulation ("deregulated-) and l«r. 
prrferabl, not subject to signincant levels of auostenc regulabn 
Unregulated-) while maintaining adequate catalyse actmty 
25 The structural coding sequence for a bactenal or plant 
ADPglucose pyrophosphorylase enzyme can be mutagemxed m 
* coli or another suitable, host and screened for ,ncreas^ 
Glycogen production as described for the He «6 gene of E. «*. 
U shou.d be realized that use of a gene encoding an ADPglu-e 
30 p yro phosphory.ase enzyme which is only subject to modulate 
(^iL^nhibitors) which are present in the selected plant at 
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levels which do not significantly inhibit the catalytic activity will 
not require enzyme (gene) modification. These "unregulated" or 
"deregulated" ADPglucose pyrophosphorylase genes can then be 
inserted into plants as described herein to obtain transgenic 
plants having increased starch content. 

For example, any ADPglucose pyrophosphorylase gene 
can be cloned into the E. coli B strain AC70R1-504 (Leung, 1986). 
This strain has a defective ADPglucose pyrophosphorylase gene, 
and is derepressed five- to seven-fold for the other glycogen 
biosynthetic enzymes. The ADPglucose pyrophosphorylase gene/ 
cDNA's can be put on a plasmid behind the E. coli gig C 
promoter or any other bacterial promoter. This construct can 
then be subjected to either site-directed or random mutagenesis. 
After mutagenesis, the cells would be plated on rich medium 
with 1% glucose. After the colonies have developed, the plates 
would be flooded with iodine solution (0.2w/v% I 2 , 0.4w/v% KI in 
H2O, Creuzet-Sigal, 1972). By comparison with an identical plate 
containing non-mutated E. coli, colonies that are producing 
more glycogen can be detected by their darker staining. 

Since the mutagenesis procedure could, have created 
promoter mutations, any putative ADPglucose pyrophospho- 
rylase mutant from the first round screening will have to have 
the- ADPglucose pyrophosphorylase gene reclbned into non- 
mutated vector and the resulting plasmid will be screened in the 
same manner. The mutants that make it though both rounds of 
screening will then have their ADPglucose pyrophosphorylase 
activities assayed with and without the activators and inhibitors. 
By comparing the mutated ADPglucose pyrophosphorylase's 
responses to activators and inhibitors to the non-mutated 
enzymes, the new mutant can be characterized. 
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The report by Plaxton and Preiss in 1987 demonstrates 
that the maize endosperm ADPglucose pyrophosphorylase has 
regulatory properties similar to those of the other plant 
ADPglucose pyrophosphorylases (Plaxton and Prexss 1987). 
They show that earlier reports claiming that the 
endosperm ADPglucose pyrophosphorylase had enhanced 
activity in the absence of activator (3-PGA) and decreased 
sensitivity to the inhibitor (Pi), was due to proteolytic cleavage of 
the enzyme during the isolation procedure. By altering an 
ADPglucose pyrophosphorylase gene to produce an enzyme 
analagous to the proteolytic^ cleaved maize endosperm 
ADPglucose pyrophosphorylase. decreased allosteric regulation 

will be achieved. 

To assay a liquid culture of E. coli for ADPglucose 

pyrophosphorylase activity, the cells are spun down m a 
TntrWe and resuspended in about 2m. * «~ 
(0.05 M glycylglydne pH 7.0. 5.0 mM DTE. 1.0 mM EDTA) P« 
^ of ceU paste. The cell, are lysed b, passu* tw.ee U^gh 
Iprench Press. The cell extract, are spun m a nucrocentrfuge 
£l minutes, and the supernatant, are desalted by pas,** 
through a G-50 spin column. . 

Th. enzyme assay for the synthesis of ADPgluco* ... 
n^dificauon of a published procedure (Haugen. 1976). Each 100 
71, contain, 10 umole Hope. pH 7.7. 50 ug BSA. 0 05,xmole 
Tf t..Clglucose-1-phosphate, 0.15 umole ATP. 0.5 umole MgCl* 
0 ! ug of crystalline yeast inorganic pyrophosphatase 1 mM 

• mn lvhdate enzyme, activators or inhimtors as 
ammonium molybdate. enzym . 

desired, and water. The assay .s mcubated at 37 C 
%Z», and is stopped by boiling for 60 seconds. The assay . 
spun down in a microcentrifuge, and 40 pi of the supernatant , 
injected onto a Synchrom Synchropak AX-100 anion exchange 
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HPLC column. The sample is eluted with 65 mM KPi pH 5.5. 
Unreacted [i 4 C]glucose-l-phosphate elutes around 7-8 minutes, 
and [i*C]ADPglucose elutes at approximately 13 minutes. 
Enzyme activity is determined by the amount of radioactivity 

5 

found in the ADPglucose peak. 

The plant ADPGPP enzyme activity is tightly regulated, 
by both positive (3-phosphoglycerate; 3-PGA) and negative 
effectors (inorganic phosphate; Pi) (Ghosh and Preiss, 1966; 

jq Copeland and Preiss 1981; Sowokinos and Preiss 1982; Morell et 
al. t 1987; Plaxton and Preiss, 1987; Preiss, 1988;) and the ratio of 
3PGA:Pi plays a prominent role in regulating starch 

biosynthesis by modulating the ADPGPP activity (Santarius and 
Heber, 1965; Heldt et aL, 1977; Kaiser and Bassham, 1979). The 

15 plant ADPGPP enzymes are heterotetramers of two 
large^'shrunken" and two smallTBrittle" subunits (Morell et 
aL, 1987; Lin et aL, 1988a, 1988b; Krishnan et aL, 1986; Okita et 
aL, 1990) and there is strong evidence to suggest that the 
heterotetramer is the most active form of ADPGPP. Support for 

20 this suggestion comes from the isolation of plant "starchless" 
mutants that are deficient in either of the subunits (Tsai and 
Nelson, 1966; Dickinson and Preiss, 1969; Lin et al. t 1988a, 1988b) 
and from the characterization of an "ADPGPP" homotetramer of 
small subunits that was found to have only low enzyme activity 

25 (Lin et aL, 1988b). In addition, proposed effector interaction 
residues have been identified for both subunits (Morell et aL, 
1988). 

Unregulated enzyme variants of the plant ADPGPP are 
identified and characterized in a maimer similar to that which 
30 resulted in the isolation of the E. coli glgCl6 and related 
mutants. A number of plant ADPGPP cDNA's, or portions of 
such cDNA's, for both the large and small subunits, have been 
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cloned from both monocots and dicots (Anderson et al 1989a; 
Olive et al., 1989; Muller et al.. 1990; Bhave et al.. 1990; du Jardm 
and Berhin. 1991) The proteins encoded by the <DNAs. as 
well as those described from bacteria, show a high degree of 
conservation (Bhave et al.. 1990). In partxcular, a high y 
conserved region, also containing some of the -due 
indicated in enzyme function and effector -tenons has 
been identified (Morell et al.. 1988; du Jardin and Berhm. 1991). 
Clones of the potato tuber ADPGPP subunit genes have been 
10 isolated. These include a complete small subumt gene, 
acsembled by addition of sequences from the first exon of the 
/enomic clone with a nearly full-length cDNA clone of the same 
ge ne and an almost complete gene for the large subumt. The 
nucleotide sequence (SEQ ID NO:7) and the amino acid sequence 
15 (SEQ ID NO:8) of the assembled small subunit gene is presented 
in Figure 5. The nucleotide sequence presented here dxffers 
from the gene originally isolated in the following ways: _a 
Bg lIi + NcoI site was introduced at the ATG codon to faohtate the 
cloning of the gene into E. coli and plant expression vectors by 
20 site directed mutagenesis utilizing the oligonucleotide pnmer 

GTTgTtAACAAGATCTGTTAACCATGGCGGCTTCC (SEQ 

n) no-.11). 

A Sad site was introduced at the stop codon utxhzmg the 

25 oligonucleotide primer sequence 

CCAGTTAAAACGGAGCTCATCAGATGATGATTC (SEQ ID 

^eSacl site serves as a 3" cloning site. An internal BglU site 
was removed utilizing the oligonucleotide primer sequence 
30 QTGTGAGAACATAAATCTTGGATATGTTAC (SEQ 

NO:13). 
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This assembled gene was expressed in E. coli under the control 
of the recA promoter in a PrecA-genelOL expression cassette 
(Wong et al. t 1988) to produce measurable levels of the protein. 
An initiating methionine codon is placed by site-directed 
mutagenesis utilizing the oligonucleotide primer sequence 
GAATTCACAGGGCCATGGCTCTAGACCC (SEQ ID NO: 14) 
to express the mature gene. 

The nucleotide sequence (SEQ ED NO:9) and the amino 
acid sequence (SEQ ID NO: 10) of the almost complete large 
subunit gene is presented in Figure 6. An initiating methionine 
codon has been placed at the mature N-terminus by site-directed 
mutagenesis utilizing the oligonucleotide primer sequence 
AAGATCAAACCTGCCATGGCTTACTCTGTGATCACTACTG 
(SEQ ID NO:15). 

The purpose of the initiating methionine is to facilitate the 
expression of this large subunit gene in E. coli. A HindUI site is 
located 103 bp after the stop codon and serves as the 3* cloning 
site. The complete large ADPGPP gene is isolated by the 5* 
RACE procedure (Rapid Amplification of cDNA Ends; Frohman, 
1990; Frohman et al., 1988; Loh et al., 1989). The oligonucleotide 
primers for this procedure are as follows: 
1) GGGAATTCAAGCTTGGATCCCGKJGCCCCCCCCCCCCCCC 
(SEQ ID NO:16); 

.2) GGGAATTCAAGCTTGGATCCCGGG (SEQ ID NO:17); and 
3) CCTCTAGACAGTCGATCAGGAGCAGATGTACG (SEQ ID NO:18). 
The first two are the equivalent to the ANpolyC and the AN 
primers of Loh et al. (1989), respectively, and the third is the 
reverse complement to a sequence in the large ADPGPP gene, 
located after the Pst I site in the sequence in Figure 6. The PCR 
5* sequence products are cloned as EcoRVHindllVBamHI-Pstl 
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fragments and are easily assembled with the existing gene 
portion. 

The weakly regulated enzyme mutants of ADPGPP are 
identified by initially scoring colonies from a mutagenized E. coli 
culture that show elevated glycogen synthesis, by iodine staining 
of 24-48 hour colonies on Luria-Agar plates containing glucose at 
1%, and then by characterizing the responses of the ADPGPP 
enzymes from these isolates to the positive and negative effectors 
of this activity (Cattaneo et al.. 1969; Preiss et al., 1971). A 
similar approach is applied to the isolation of such variants of 
the plant ADPGPP enzymes. Given an expression system for 
each of the subunit genes, mutagenesis of each gene is carried 
out separately, by any of a variety of known means, both 
chemical or physical (Miller, 1972) on cultures containing the 
gene or on purified DNA. Another approach is to use a PCR 
procedure (Ehrlich, 1989) on the complete gene in the presence of 
inhibiting Mn~ ions, a condition that leads to a high rate of 
misincorporation of nucleotides. A PCR procedure may also be 
used with primers adjacent to just a specific region of the gene, 
20 and this mutagenized fragment then recloned into the non- 
mutagenized gene segments. A random synthetic oligo- 
nucleotide procedure may also be used to generate a highly 
mutagenized short region of the gene by mixing of nucleotides in 
the synthesis reaction to result in misincorporation at all 
25 positions in this region. This small region is flanked by 
restriction sites that are used to reinsert this region into the 
remainder of the gene. The resultant cultures or transformants 
are screened by the standard iodine method for those exhibiting 
glycogen levels higher than controls. Preferably this screening 
is carried out in an E. coli strain deficient only in ADPGPP 
activity (such as E. coli fcC618\ which is a spontaneous mutant of 
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LC618 (Cattaneo et al., 1969; Creuzet-Sigal et aL, 1972) that is 
phenotypically glycogen-minus and that is complemented to 
glycogen-plus by glgC. The E. coli strain should retain those 
other activities required for glycogen production. Both genes are 
expressed together in the same E. coli host by placing the genes 
on compatible plasmids with different selectable marker genes, 
and these plasmids also have similar copy numbers in the 
bacterial host to maximize heterotetramer formation. Examples 
of compatible plasmids include the pBR322/pBR327/pUC series 
(with Ampicillin selection) based on the ColEl replicon and the 
pACYC177 plasmid (with Kanamycin selection) based on the 
pl5A replicon (Chang and Cohen, 1978). The use of .separate 
plasmids enables the screening of a mutagenized population of 
one gene alone, or in conjunction with the second gene following 
transformation into a competent host expressing the other gene, 
and the screening of two mutagenized populations following the 
combining of these in the same host. Following re-isolation of 
the plasmid DNA from colonies with increased iodine st ainin g, 
the ADPGPP coding sequences are recloned into expression 
vectors, the phenotype verified, and the ADPGPP activity and its 
response to the effector molecules determined. Improved 
variants will display increased V max , reduced inhibition by the 
negative effector (Pi), or reduced dependence upon activator (3- 
PGA) for maximal activity. The assay for such improved 
characteristics involves the determination of ADPGPP activity in 
the presence of Pi at 0.045 mM (Io,s = 0.045 mM) or in the 
presence of 3-PGA at 0.075 mM (A 0 .5 = 0.075 mM). The useful 
variants will display <40% inhibition at this concentration of Pi 
or display >50% activity at this concentration of 3-PGA. 
Following the isolation of improved variants and the 
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. f fhP aubunit or subunits responsible, the 

determination of the suduiuv u 

detenni f „ m ; ned by nucleotide sequenang. The 

. end reassay of ADPGPP activity in the presence of 

e^>aUnt compUte ADPGPP cDNA gene, by redo- t J. 

V on containing the change from- the altered ba tenal 

• ft.™ to the plant form containing the amyloplast 

T ' e or by site-directed mutagenesis of the 
targeting sequence, or Dy 

complete native ADPGPP plant 




Starch biosynthesis >s known to take place m p 
» chloroplasts and amyloplasts (herein 

-plastids-. In the plants that have hsen studied the ADPg u»« 
>„,.„t,„rvlase is localized to these plastids. ADPglucose 
pyrophosphory Use .s 1 in pea shoots 

- sZtr leaves, al, of the ADPglucose 
- ^hoX,ase activity, along with the ^J^T 
.cavity is found in the chloroplasts (Mares, 1978 and Ofata, 
^, Immunocy.chemical location show, that ft. potau 
!b! ADPglucose pyrophosphorylase is found exdunvel, r-n *e 
^yioplasu (Kim, 1M Studies with rice * 
=S 8 Jw. that the ADPglucose pyrophosphorylase actmty 

.mall subunit of Ribulose-l.S-bisphosphate carboxylase 
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(ssRUBISCO, SSU), 5-enolpyruvateshikimate-3-phosphate 
synthase (EPSPS), Ferredoxin, Ferredoxin oxidoreductase, the 
Light-harvesting-complex protein I and protein II, and 
Thioredoxin F. It has been demonstrated in vivo and in vitro 
that non-chloroplast proteins may be targeted to the chloroplast 
by use of protein fusions with a CTP and that a CTP sequence is 
sufficient to target a protein to the chloroplast. Likewise, 
amyloplast-localized proteins are expressed from nuclear genes 
as precursors and are targeted to the amyloplast by an 
amyloplast transit peptide (ATP). It is further believed that the 
chloroplast and amyloplast are developed from common 
proplastids and are functionally distinct only in that the former 
is found in photosynthetic cells and the latter in non- 
photosynthetic cells. In fact, interconversion between the two 
organella has been observed in plants such as Picea abies 
(Senser, 1975). There are also reports showing that the 
amyloplast and chloroplast genomes from the same plant are 
indistinguishable (Scott, 1984; Macherel, 1985 and Catley, 1987). 
It has been further shown that an amyloplast transit peptide 
functions to import the associated polypeptide into chloroplasts 
(Klosgen, 1989). 

In the exemplary embodiments, a specialized CTP, 
derived from the ssRUBISCO 1A gene from Arabidopsis 
ihaliana (SSU 1A) (Timko, 1988) was used. This CTP (CTP1) was 
constructed by a combination of site-directed mutageneses. The 
CTP1 nucleotide sequence (SEQ ID NO:5) and the corresponding 
amino acid sequence (SEQ ED NO:6) is also shown in Figure 3. 
CTP1 is made up of the SSU 1A CTP (amino acid 1-55), the first 
23 amino acids of the mature SSU 1A protein (56-78), a serine 
residue (amino acid 79), a new segment that repeats amino acids 
50 to 56 from the CTP and the first two from the mature protein 
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•A. 80 87) and an alanine and methionine residue 

(anUn ° "f^td 89) An Ncol restriction site is located at the 
(ami - aad 88 and 89 An ^ rf 

3" end (spans the Met co g pyropho8phory lase 

^TZ sZ a B m site was introduced upstream of 

eene. At a later stage, « fo^iitate the 

e r cqtt 1A sequences to facilitate tnc 

the N-tenninus of the bt>u ^ b«h 

CTpT CTP and the *** CIS gene from A co/i to produce a 
1 P leuTtructural DNA seance encoding the , trans, 
pe'de/ADPglucose pyrophosphorylase & 
Those skilled in the art will recognize that if either 
, , ♦ ADPelucose pyrophosphorylase cDNA encoding 
single plant ADPglucose pyr P h ^pQpp cDNAs 

shrunken and/or brittle subuxuts or both plant ADPGPP 
* encoding shrunken and brittle subunits is utilized n the 
1- Af the present invention, the endogenous CTP or ATr 
^t mo a ily ald Preferably be used. Hence, for purposes of 
could most easily P ^ should 

4. P-«^ ^ transit peptides and 

be interpreted to include bo ^ ^ ^ 

amyloplast transit peptides. Those sfcwea ^ d 
\oL* that various other chimeric constructs can be ma 
recognize tnai v«n«"» ,. -i-ntid transit 

which utilize the functionality of a particular plasbd ^ 

7a ♦ import the contiguous ADPglucose pyrophosphorylase 
pepUde to import depending on 

enzyme into the plant cell P^ of ^ fusion 

25 the promoter tissue specificity, lne runcw j 
^ptfde can be confined uau* the foUowog^ 
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lonrffolia) by centrifugation in Percoll/ticoi 



WO 91/19806 



PCT/US9 1/04036 



29 



of intact chloroplasts is suspended in 0.5 ml of sterile 330 mM 
sorbitol in 50 mM Hepes-KOH, pH 7.7, assayed for chlorophyll 
(Arnon, 1949), and adjusted to the final chlorophyll 
concentration of 4 mg/ml (using sorbitol/Hepes). The yield of 
intact chloroplasts from a single head of lettuce is 3-6mg 
chlorophyll. 

A typical 300 pi uptake experiment contained 5 mM 
ATP, 8.3 mM unlabeled methionine, 322 mM sorbitol, 58.3 mM 
Hepes-KOH (pH 8.0), 50 \il reticulocyte lysate translation 
products, and intact chloroplasts from L. sativa (200 jig 
chlorophyll). The uptake mixture is gently rocked at room 
temperature (in 10 x 75 mm glass tubes) directly in front of a 
fiber optic illuminator set at maximum light intensity (150 Watt 

^ bulb). Aliquots of the uptake mix (50 nl) are removed at various 
times and fractionated over 100 ^1 silicone-oil gradients (in 150 pi 
polyethylene tubes) by centrifugation at 11,000 X g for 30 seconds 
Under these conditions, the intact chloroplasts form a pellet 
under the silicone-oil layer and the incubation medium 
(containing the reticulocyte lysate) floats on the. surface. After 
centrifugation, the silicone-oil gradients are immediately frozen 
in dry ice. The chloroplast pellet is then resuspended in 50-100 
\d of lysis buffer (10 mM Hepes-KOH pH 7.5, 1 mM PMSF, 1 mM 
benzamidine, 5 mM e-amino-n-caproic acid, and 30 iig/ml 

25 aprotinin) and centrifuged at 15,000 X g for 20 minutes to pellet 
the thylakoid membranes. The clear supernatant (stromal 
proteins) from this spin, and an aliquot of the reticulocyte lysate 
incubation medium from each uptake experiment, are mixed 
with an equal volume of 2X NaDodS04-PAGE sample buffer for 

30 electrophoresis (see below). 

SDS-PAGE is carried out according to Laemmli (1970) 
in 3-17% (w/v) acrylamide slab gels (60 mm X 1.5 mm) with 3% 
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WW acry.an.ide stacking gel. « — X " ™* ™" ^' ^ 

(w/v) aery cftlut ion with 40% methanol and 10* 

dryer. The gel overnight exposure to determine 

intensifying screen and an overmg ^ ^ 
whether the ADPglucose pyrophosphorylase unpo 

i30lat6d "^ive means for enhancing ADPglucose levels " 
• , t^ell S be to isolate genes encoding transcription 
in plant cells win uc „ ml utnTv elements of 

JLn which interact with the upstream ***** 
^ piant ADPglucose pyrophosphorylase ■"^^ 

■ r .>,.«. transcription factors in plant cells can caun 

^nnd" conditions, the increased starch content ,. 
gene. Under these co ADPglucose 

^r^^-^ --^^ mechani sm is 
pyT ophosphorylase ^^Xlscription factors have 
different. Methods for the isolation of transcnp 
been described (Katagiri, 1989). 

i. noTj aline synthase (NOS) gene, ^ 

plasmid genes, such as the nopaiine y» ^ 

, *. Hke the soybean storage protein genes au 

(2) plant genes like tne soy carboxylase 
30 8m all subunit of the ribulose-1 5- ^ ^ 
(ssRUBISCO) gene. An-e*ample of a preferred 3 regi 
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from the NOS gene, described in greater detail in the examples 
below. 

Plant Transformation /Regeneration 

Plants which can be made to have increased starch 
content by practice of the present invention include, but are not 
limited to, corn, wheat, rice, carrot, onion, pea, tomato, potato, 
sweet potato, peanut, canola/oilseed rape, barley, sorghum, 
cassava, banana, soybean, lettuce, apple and walnut. 

A double-stranded DNA molecule of the present 
invention containing the functional plant ADPglucose 
pyrophosphorylase gene can be inserted into the genome of a 
plant by any suitable method. Suitable plant transformation 
vectors include those derived from a Ti plasmid of 
Agrobacterium tumefaciens, as well as those disclosed, e.g., by 
Herrera-Estrella (1983), Bevan (1983), Klee (1985) and EPO 
publication 120,516 (Schilperoort et al.). In addition to plant 
transformation vectors derived from the Ti or root-inducing (Ri) 
plasmids of Agrobacterium , alternative methods can be used to 
insert the DNA constructs of this invention into plant cells. 
Such methods may involve, for example, the use of liposomes, 
electroporation, chemicals that increase free DNA uptake, free 
DNA delivery via microprojectile bombardment, and 
transformation using viruses or pollen. 

A plasmid expression vector, suitable for the expression 
of the E. coli glgCl6 and other ADPGPP genes in monocots is 
composed of the following: a prompter that is specific or 
enhanced for expression in the starch storage tissues in 
monocots, generally the endosperm, such as promoters for the 
zein genes found in the maize endosperm (Pedersen et al., 1982); 
an intron that provides a-spiice site to facilitate expression of the 
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„ the ADH1 intron (Callas et al., 1987); and a 3 
gene, such as the ADH1 ^ 3 

polyadenylation sequence s expreBS ion cassette 

tra nsformati.n vect°r ^ ^ S.G., 

dicotyledonous plants is pia derivative of 

198V , Plasmid PMON530 (see » ^ 

P MON505 prepared by trWerrmg ^ ^ 26 

lament of P MON316 ^^ rf ^ f pMON5 05 in which 
Hasmid pMON526 is a ample denvatm- op ^ 

Sma! -U is -^^r^^N 5 30 retain. aU 
Klenow polymerase ana gr CaMV3 5S-NOS expression 

- - erti r:::" : ~ * f - smI 
r^- -jsstrtss. o t pmok M o 

r^St^plasnndho^on 
(Rogers. S.G.. > Hindin to Smal segment of 

UH. has been ^^^^ » Haling W* 
*" UB, 'T l! fm3. of replication. oriV. «k1 

Tbi. ««•-» -7" * ^c n^«on into A**-**» 
,he origin of transfer. onT for con, ga ^ 

^thetri-paren^ma^P-*- ^ ^ of 
5 PLsmid PMON505 retama all the P^ ^ rf 

P MON200 induding the j^UrMOS gene 

desired DNA fragments, the com «. «^ 3pMtm0 . 
for kanamycin resistance » plant » . n 
.ycin/streptromyan ""^^thase gene for 
» E. coli and A. mmtfacens, •» n ° ' and 
facile scoring of transformants and mhenUnce 
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a pBR322 origin of replication for ease in making large amounts 
of the vector in E. co/i. Plasmid pMON505 contains a single T- 
DNA border derived from the right end of the pTiT37 nopaline- 
type T-DNA. Southern analyses have shown that plasmid 
pMON505 and any DNA that it carries are integrated into the 
plant genome, that is, the entire plasmid is the T-DNA that is 
inserted into the plant genome. One end of the integrated DNA 
is located between the right border sequence and the nopaline 
synthase gene and the other end is between the border sequence 
and the pBR322 sequences. 

When adequate numbers of cells (or protoplasts) 
containing the ADPglucose pyrophosphorylase gene or cDNA 
are obtained, the cells (or protoplasts) are regenerated into whole 
plants. Choice of methodology for the regeneration step is not 
critical, with suitable protocols being available for hosts from 
Leguminosae (alfalfa, soybean, clover, etc), Umbelliferae 
(carrot, celery, parsnip), Cruciferae (cabbage, radish, rapeseed, 
etc.), Cucurbitaceae (melons and cucumber), Gramineae 
(wheat, rice, corn, etc.), Solanaceae (potato, tobacco, tomato, 
peppers) and various floral crops. See, e.g., Ammirato (1984); 
Shimamoto, 1989; Fromm, 1990; Vasil, 1990. 

The following examples are provided to better elucidate 
the practice of the present invention and should not be 
interpreted in any way to limit the scope of the present invention. 
Those skilled in the art will recognize that various 
modifications, truncations, etc. can be made to the methods and 
genes described herein while not departing from the spirit and 
scope of the present invention. 
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EiaDmk i express the E. coii gig CIS 6 ene in plant cells, and to 
target the enzyme to the plastids. the gene needed *> be fused » a 
D ?A encoding the plastid-taxgeting transit peptide (hereinafter 
referred to as the CTP/ADPglucoee pyrophosphatase gene), 
and to the proper plant regulatory regions. Th,s wa. 
accomplished by cloning the gig C16 gene int. a sene, 
plasmid vectors that contained the needed sequences 

The plasmid pLP226 contains the gig CIS gene on a 
Hindi fragment, cloned into a pUCS vector at the Hindi site 
aelg et ^986,. pU226 was obtained from Dr. Jack Pre*, at 
Michigan State University, and was tran.fom.ed into froxen 
^patent E. coll JM101 cells, prepared by the calctum chlonde 
me thod (Samhrook et al.. 1989). The transformed ceH, . were 
pUted on 2XYT (infra) plates that contained amptcubn at 100 
The pUsmid P LP226 was purified by the rap,d ^e 
extraction procedure (KAE) from a 5 ml overmght culture 

(Birnboim and Doly 1979). . 

To fuse the gig CIS gene to the DNA encoding the 
chloroplast transit peptide, a Ncol site was needed at the 5 end 
rfthe gene. A Sad site downstream of the Unmnafon codon 
It also needed to move the CTP/ADPgl«c..e 
pyrophosphatase gene into the next vector. In order to 
LTroduce these sites, a PCK reaction (.13) was run usvng 
approximately 20 ng of rapid alkaline "tractton-punfied 
pLsmid PLP226 for a template. The react™-.* se^up 
o.lowing the recommendations of the 

Elmer Cetus). The primers were QSP3 andQS P . QSP3 was 
30 designed to introduce the Ncol site that would mdude the start 
eoTL the gig C 16 gene. The QSP7 primer hybridixed ,n the 
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3' nontranslated region of the gig C16 gene and added a Sad 
site. The Thermal Cycler was programmed for 30 cycles with a 1 
minute 94°C denaturation step, a 2 minute 50°C annealing step, 
and a 3 minute 72°C extension step. After each cycle, the 
extension step was increased by 15 seconds. 

QSP3 Primer. 

5 f -GGAGTTAGCCATGGTTAGTTTAGAG-3' (SEQ ID NO:19) 



10 



QSP7 Primer: 

5'-GGCCGAGCTCGTCAACGCCGTCTGCGATTTGTGC-3' 
(SEQ ID NO;20) 



The vector that the PCR product was cloned into was 
pGEM3zf+ (obtained from Promega, Madison, WI) that had been 
digested with SacI and Hind III, and had the DNA for the 
modified Arabidopsis small subunit CTP1 ligated at the HindlTT 
site. The DNA (SEQ ID NO:5) and amino acid sequence (SEQ ID 
NO:6) of this CTP1 are shown in Figure 3. 

The linearized vector was treated with 5 units of calf 
intestinal alkaline phosphatase for 30 minutes at 56*C. Then, 
both the vector and the PCR #13 fragment, which had the gig 
C16 gene with the new Ncol and SacI sites, were ran on an 
agarose gel and the fragments were purified by binding to DEAE 
membranes. The protocol used for the fragment purification 
with the DEAE membrane is from Schleicher and Schuell, and 
is titled "Binding and Recovery of DNA and RNA Using S and S 
DEAE Membrane/' 

Ligation #5 fused the gig C16 gene to the DNA for the 
modified Arabidopsis SSU CTP with the pGEM3zf+. The ligation 
contained 3 ill of vector that had been digested with Ncol and 
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Sad, along with 3 u, of *e PGR .13 product, that ^Uo been 
cut with Ncol and Sad and repurified on a gel. 5 pi W ^0»1 
oul, of ligation .5 was transformed into 
MM1 cells, and the transformed cells were plated on 2XYT 
pUte. (U Barto-.ryp.onO. 10 tfl yeast extract. !0 grt Nad. pH 
7 3 and solidified with 1.5* agar) containing ampmlun. 

Sample 1 was picked from a plate •«» «"™* t 
growth. This sample was inoculated into 4 ml of medm 
L grown overnight at 37-C. The plasmid was .solated by *e 
° tid alpine extraction procedure, and the DNA was 

Tth EcoRI. N-l. and EcoRI and Ncol together. The d,gest was 
grated on an agarose gel. and the expected ^« 
oblved. The plasmid isolated from sample ! was des.gn.Ud 
pMONMiOO. and consisted of pGEM3*f+. the DNA or the 
* modified Matidop.is SSU CTP. and the * C« 8-e. The 
fcaion was in the orientation that allowed it to be transcnbed 
from the SP6 polymerase promoter. Mjpducose 
To test this construct for import of the ADPglucos 
pyrophosphorylase into isolated lettuce 
50 CTP/ADPglucose pyrophosphorylase fusion needed to 

™beu and translaUd to produce ^1-Ubeled^-se 
pyrophosphorylase. To make a DNA template for 
bythe SP6 polymerase, the CTP/ADPglucose pyrophosphorylase 
region of pMON20100 was amplified by PCR to generate AW 
* amount of linear DNA. To do this, about 0.1 ul of pMON2 0 m 
mat had been purified by rapid alkahne extract was used as 
a template in PCR reaction .80. The pnmer. were 
commercially available SP6 promoter primer 
„Ug„ QSP7. The SP6 primer hybridized to the SP6 promoter -n 
» I* vector, and included the entire SP6 promoter seouen^ 
Therefore, a PCR product primed with this ohgonucleoUde vnll 
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contain the recognition sequence for the SP6 polymerase. The 
QSP7 primer will hybridize in the 3' nontranslated region of the 
gig C16 gene. This is the same primer that was used to 
introduce a Sad site downstream of the gig C16 termination 
codon. The Thermal Cycler was programmed for 30 cycles with 
a 1 minute denaturation at 94°C, a 2 minute annealing at 55°C, 
and a 3 minute extension at 72°C. After each cycle, 15 seconds 
were added to the extension step. 

SP6 Promoter Primer: 

5'-GATTTAGGTGACACTATAG-3' (SEQ ID NO:21) 

5 ul of PCR reaction #80 was run on an agarose gel and 
purified by binding to DEAE membrane. The DNA was eluted 
and dissolved in 20 uJ of TE. 2ul of the gel-purified PCR #80 
product was used in an SP6 RNA polymerase in vitro 
transcription reaction. The reaction conditions were those 
described by the supplier (Promega) for the synthesis of large 
amounts of RNA (100 ul reaction). The RNA produced from the 
PCR reaction #80 DNA was used for in vitro translation with the 
rabbit reticulocyte lysate system (Promega). 3SS-labeled protein 
made from pMON20100 (ie:PCR reaction* 80) was used for an in 
vitro chloroplast import assay as previously described. After 
processing the samples from the chloroplast import assay, the 
samples were subjected to electrophoresis on SDS-PAGE gels 
with a 3-17% polyacrylamide gradient. The gel was fixed for 
20-30 minutes in a solution with 40% methanol and 10% acetic 
acid. Then, the gel was soaked in EN3HANCE™ for 20-30 
minutes, followed by drying the gel on a gel dryer. The gel was 
imaged by autoradiography, using an intensifying screen and 
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an overnight exposure. The re 9 ulta demonstrated that the 
fusion protein was imported into the isolated chloroplasts 

The construct in pMON20100 was next engineered to be 
fused to the En-CaMV35S promoter (Kay, R. 1987) and the NOS 
3- end (Bevan. M. 1983) isolated from pMON999. PCR reaction 
114 contained plasmid pMON 20100 as a template, and used 
primers QSMll and QSM10. QSMll annealed to the DNA for the 
modified Arabidopsis SSU CTP and created a Bglll sate 7 bp 
upstream from the ATG start codon. QSM10 annealed to the 
3- end of the gig C16 gene and added an Xbal site immediately 
after the termination codon, and added a Sad site 5 bp after the 
termination codon. The Sad site that had earlier been added to 
the gig CIS gene was approximately 100 bp downstream of the 
termination codon. The Thermal Cycler was programmed for 25 
15 cycles with a 1 minute 94°C denaturation, a 2 minute 55°C 
annealing, and a 3 minute 72«C extension step. With each cycle, 
15 seconds was added to the extension step. 

QSMll Primer. 

20 5*-AGAGAGATCTAGAACAATGGCTTCCTCTATGCTCTCTTCCGC-3 



25 



(SEQ ID N0.22) 

QSM10 Primer. 
5*-GGO 
NO:23) 



QSM10 Primer. 

5*-GGCCGAGCTCTAGATTATCGCTCCTG3TTATGCCCTAAC-3 (SEQ ID 



Ninety-five (95)ul (from 100 ul total volume) of PCR 
reaction #114 was ethanol precipitated, and resuspended in 20 ul 
of TE Five (5) ul of this was digested with Bglll (4 units) and 
30 Sad (10 units) overnight at 37»C. Five (5) ul (5 ug) of the vector. 

DMON999. which contains the En-CaMV35S promoter and the 



WO 91/19806 



PCT7US9 1/04036 



59 

NOS 3' end, was digested in the same manner. After digestion 
with the restriction enzymes, the DNAs were run on an agarose 
gel and purified by binding to DEAE membranes. Each of the 
DNAs were dissolved in 20 \d of TE. One (1) ^1 of PCR 114 was 
ligated with 3 |il of the vector, in a total volume of 20 pd. The 
ligation mixture was incubated at 14°C for 7 hours. Ten (10) pi of 
the ligation was transformed into frozen competent MM294 cells 
and plated on LB plates (10 g/1 Bacto-tryptone, 5 g/1 yeast extract, 
10 g/1 NaCl, and 1.5% agar to solidify) with 100 (ig/ml ampicillin. 
Colonies were picked and inoculated into tubes with 5 ml of LB 
media with 100 (ag/ml ampicillin, for overnight growth. The 5 
ml overnight cultures were used for rapid alkaline extractions to 
isolate the plasmid DNAs. The DNAs were digested with EcoRI, 
and separate aliquots were digested with Notl. After analyzing 
these samples on agarose gels, the plasmid pMON20102 was 
confirmed to have the 497 bp EcoRI fragment that is 
characteristic of the gig C16 gene. This plasmid also contained 
the 2.5 kb Notl fragment which contained the En-CaMV35S 
promoter, the DNA for the modified Arabidopsis SSU CTP, the 
gig C16 gene, and the NOS 3' end. 

The 2.5 kb Notl cassette was then transferred into a 
plant transformation vector, pMON530 (Figure 4). pMON530 
contains a unique Notl site in the RK2 region, exactly 600 bp 
after the Hindlll site. A description of the construction of 
pMON530 can be found in Rogers et aL, 1987. Twenty (20) \ig of 
pMON530 was digested with 40 units of Notl overnight at 37°C. 
The digested vector was then dephosphorylated with 22 units of 
calf alkaline intestinal phosphatase at 37°C for about 1 hour. 
The pMON530 vector was extracted with phenol/chloroform, 
then chloroform, and was ethanol precipitated. Ten (10) \ig of 
plasmid pMON20102 was also digested overnight at 37°C with 40 
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fNotl TT.eNotl-digesUdpMONSaO vector was ligated to 
units of NotL The Not s Q102 at l5 o C overnight. 

^ HotI cassette from P^^~ compete »t JM101 
The ligation ^V^ rmed ^s were plated on 1^ with 75 
E. coli cells, and the transformed cells we 

tig/ml spectinomycin. transformation 
Nine colonies were picked from the ir 

• * ml LB cultures for screening. Plasmids 
plate and grown m 5 ml LB cult 
from 5 ml cultures were prepared b _ the ^ P 
extraction procedure. The DNAs were f£ e n y 
0 digestions which were separated o» l . 1% * ' * 

— "^^0"^ oL-t wis isolated, 
parent plasmid, P MON530 the ^ and ^ orientati on 

The construct was designated PM™< 
determined by PstI digestion and Ncoimgl I 
IS „ „ r.MVlRS promoter dnving the ^ir"*~ e 

The En-CaMV35S pr orie ntation as the 

pvrophosphory,ase gene .s m the sa „. 
CaMV35S promoter that was already present _ P 

I„ preparation for transform** " ba ^ 

^wn overnight in kanamycn (50 ug/ml>. ^ 

started with several =°>-« f ™ ^ epecti nomvcin. 
25 pMON20104 was grown in LB wth 75 pg/rnl ^ ^ ^ 

Ate all of the cultures were grown 4 ml of LB ^ 
tube with 100 |d each of A**-*— ■ ^ J for 5 

minutes and decanted. The peiie 
» remaining H,uid, and pipetted into the „ 
MU r overnight gn.wth"at 30"C, a loop of ceUs from Uns 
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streaked onto an LB plate with 75 \xg/ml spectinomycin and 25 
pig/ml chloramphenicol. 

After 1-2 days at 30°C, the plate from the triparental 
mating of pMON20104, Agrobacterium ASE, and pRK2013, had 
growing colonies, while the control plate from the mating of 
pMON20104 and ASE (without pRK2013, which is needed for 
mobilization) did not. After the triparental mating, 2 colonies 
were picked from the plate, inoculated into a liquid culture with 
75 \xg/m\ spectinomycin, 25 ^ig/ml chloramphenicol, and 50 
pg/ml kanamycin, and grown at 30°C. These two cultures were 
used for transformation into tobacco. 

The tobacco leaf disc transformation protocol uses 
healthy leaf tissue about 1 month old. After a 15-20 minute 
surface sterilization with 10% Clorox plus a surfactant, the 
leaves were rinsed 3 times in sterile water. Using a sterile paper 
punch, leaf discs are punched and placed upside down on MS104 
media (MS salts 4.3 g/1, sucrose 30 g/1, B5 vitamins 500X 2 ml/1, 
NAA 0.1 mg/1, and BA 1.0 mg/1) for a 1 day preculture. 

The discs were then incolated with an overnight culture 
of Agrobacterium ASE:pMON20104 that had been diluted 1/5 (ie: 
about 0.6 OD). The inoculation was done by placing the discs in 
centrifuge tubes with the culture. After 30 to 60 seconds, the 
liquid was drained off and the discs* were blotted between sterile 
filter paper. The discs were then placed upside down on MS 104 
feeder plates with a filter disc to co-culture. 

After 2-3 days of co-culture, the discs were transferred, 
still upside down, to selection plates with MS104 media. After 2- 
3 weeks, callus formed, and individual clumps were separated 
from the leaf discs. Shoots were cleanly cut from the callus 
when they were large enough to distinguish from stems. The 
shoots were placed 6n~hormone-free rooting media (MSO: MS 
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salt8 4 3 g/1, sucrose 30 g/1. and B5 vitamins 500X 2 ml/1) with 
selection Roots formed in 1-2 weeks. Rooted shoots were placed 
in soil and were kept in a high humidity environment (xe: plastic 
containers or bags). The shoots were hardened off by gradually 
1 exposing them to ambient humidity conditions. 

Starch levels of transformed callus tissue was 
qoantitatad by a modification of the procedure of Lin et al. (Lin et 
al 1988a) Clumps of callus were removed from their plates, 
taking care not to include any agar. The callus was put into 1.5 
10 ml microcentrifuge tubes and dried under a vacuum in a SPEED 
VAC™ (Savant). After several hours of drying, the tubes were 
removed and weighed on an analytical balance to the closest 0.1 
mg The tubes were returned to the SPEED VAC™ for several 
more hours, then were reweighed to determine if a stable dry 
15 weight had been obtained. The dried callus was ground m the 
tube and thoroughly mixed, to give a homogenous sample. An 
aliquot of each dried callus sample was removed and put into a 
preweighed 1.5 ml microcentrifuge tube. These new tubes were 
then reweighed. and the weight of the calli samples in them was 
20 determined. The samples ranged from 9 to 34 mg. 

Approximately 1 ml of 80% ethanol was added to each 
tube, and the tubes were incubated in a 70»C water bath for 10-20 
minutes. The samples were then spun down, and the ethanol 
was removed. The. ethanol wash was done 2 more tunes After 
25 the last ethanol wash, the samples were dried in a Speed Vac . 
then 200 ul of 0.2 N KOH was added to each tube. The samples 
were ground using an overhead stirrer, then the samples were 
heated at 100'C for 30 minutes. Before heating the tubes, several 
small holes were made in the caps with a needle. This 
30 prevented the caps from popping off and causing a loss of 
sample. After the heating step. 40 ul of IN acetic acid was added 
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to each sample. 35 ul (7.4 units) of pancreatic alpha-amylase 
was added, followed by a 30 minute incubation at 37°C. Next, 5 
units (in 5 ul) amyloglucosidase (from Aspergillus niger) was 
added to each sample, along with 160 \il of 100 mM sodium 
acetate pH. 4.6. The samples were heated to 55°C for 1 hour, 
boiled for 2-3 minutes, and briefly spun down in a 
microcentrifuge. At this point, the samples were again dried in 
a Speed Vac™, and were resuspended in 1000 ul of 100 mM 
Tris-Cl pH 7.5. 

The samples were then assayed for glucose using the 
Glucose [HK] assay from Sigma (catalogue # 16-10). Using this 
assay, glucose in the samples (+ATP) is converted to 
glucose-6-phosphate + ADP by hexokinase. The glucose-6- 
^ phosphate (+NAD) is converted to 6-phosphogluconate + NADH. 
The increase in absorbance at 340 nm, due to NADH, is 
measured and is directly proportional to the glucose 
concentration. All assays and calculations were done as 
recommended by Sigma. The assays were conducted following 
Sigma's "Alternate Procedure," at room temperature with 10 ul 
of sample per assay, or 5 ul of sample + 5 ul of lOOmM Tris-Cl pH 
7.5. The percent starch was determined by dividing the amount 
(weight) of glucose by the dry weight of the callus. 

For the Western blots, a portion of the dried, 
22 homogenized callus from each of the 12 samples, plus the 2 
control samples, was resuspended in 200 ul of extraction buffer 
(100 mM Tris-Cl pH 7.1, 1 mM EDTA, 10% glycerol, 5 mM DTT, 1 
mM benzamidine). Each sample was ground with an overhead 
stirrer, spun in a microcentrifuge for 5 minutes at full speed, 
and the supernatants were removed to new tubes. The protein 
concentration in each sample was determined by. the BioRad 
protein assay (Lowry et al. 1951), with BSA as a standard. 
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Twenty-five (25) ug of each sample was loaded onto SDS 
polyacrylamide ge|s. with a 7-17% polyacrylamide gradient. 
Since the samples were loaded onto two gels, the same control 
callus sample was loaded onto each gel. In addition, a control 
spiked with 10 ng of pure E. coli ADPglucose pyrophosphorylase 

was loaded onto each gel. 

After electrophoresis, the gels were blotted to 
nitrocellulose using a PolyBlot™ apparatus from American 
Bionetics. The Western blots were processed according to the 
protocol provided by Promega. The filters were blocked with 1% 
BSA in TBST (10 mM Tris-Cl pH 8.0, 150 mM NaCl, and 0.05% 
Tween 20). for 30 minutes. Ten (10.0) ml of TBST plus 1.3 ul of 
the primary rabbit anti-E. coli ADPglucose pyrophosphorylase 
antibody were mixed, and the filters was incubated with this 
primary antibody for 30 minutes. The filters were then washed 3 
times with about 50 ml of TBST per wash, for 3 washes of 5 
minutes each. Ten (10.0) ml of TBST plus 1.3 ul of the secondary 
antibody (goat-anti-rabbit conjugated to alkaline phosphatase, 
Promega) was incubated with the filters for 30 minutes followed 
again by 3 TBST washes. The signals were visualized using the 
reaction of alkaline phosphatase with BCIP and NBT. and they 
were quantitated with a laser densitometer. 
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Results: 

1 r\tv 



2 
3 
4 
5 

6 

Control 2 + 10 ng 

7 
8 
9 



26.9% 
4.6 
6.4 
12.3 
15.3 
11.1 



5.5 
5.6 
9.7 



15 10 .6.6 



11 
12 

Control 2 + 10 ng 



11.4 
13.3 



0.573 
0.170 
0.0 
0344 
0.376 
0314 
0369 

ND 

0.117 

0.095 

0.0 

0376 

0342 

0329 



20 Control 1 3.0 

Control 2 3 7 

' The spiked samples were only used on the Western biota. 

ND = not determined 



The above results show the results of the quantitative 
starch assays and the integrated peak areas from the Western 
blot*. The % Starch is reported as the percent of starch relative 
to the dry weight of the callus. The peak area is the integrated 
area under the peak from a densitometer scan of the 
corresponding sample on a Western blot. Samples 1-6 were run 
on one gel, and samples 7-12 were run on another gel. Control 2 
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was run on both blots with and without 10 ng of purified E. coli 
ADPglucose pyrophosphorylase. The unspiked samples on both 
gels showed no interfering bands. The spiked samples had the 
peak areas shown. These results demonstrate that increased 
APDglucose leads to increased starch content in plant cells. 

fiyamnle 2 t . , ^ 

pMON20104, as described in Example 1, has also been 

transformed into the Desiree potato strain using the published 
tuber disc transformation protocol of Sheennan and Bevan 
(Sheennan and Bevan 1988). Virus-free tubers of Sblanum 
tuberosum var. Desiree, were peeled, washed briefly in distilled 
water and surface sterilized for 15 minutes in 10% sodium 
hypochlorite which contained a few drops of Tween 20. The 
tubers were washed 6 times in sterile water, then were 
immersed in liquid MS medium. A sterile 1 cm diameter cork 
borer was used to remove sections of the tubers, and these 
sections were then cut with a scalpel into 1-2 mm discs. The 
discs were floated in 20 ml of MS medium containing 
20 Agrobacterium ASE:pMON20104. A 10 ml culture of 
Agrobacterium ASE:pMON20104 was spun down and 
resuspended in 20 ml of MS medium before use. The culture 
and the discs were gently shaken in a petri dish. After 20 
minutes, the discs were transferred to tobacco feeder plates with 
3C5ZR medium (MS salts, 1 mg/1 Thiamine HC1. 0.5 mgA 
nicotinic arid, 0.5 mgA pyridoxine HCL, 3% sucrose, 5 uM zeatin 
riboside, and 3 uM IAA aspartic arid, pH 5.9). 

After 48 hours, infected discs were put on the new 
plates with the same medium, but without the feeder layer, and 
30 with 500 ug/ml carbenicillin and 100 ug/ml kanamycm. The 
plates were sealed with parafilm and incubated at 25»C with 16 
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hours of light/day. The discs were subcultured onto fresh plates 
every 3 weeks, and the carbenicillin concentration was lowered 
from 500 to 200 \xg/xal after 4 weeks in culture. Developing shoots 
were removed and placed in large test tubes containing MS salts 
and R3 vitamins (1 mg/1 Thiamine HC1, 0.5 mg/1 nicotinic acid, 
0.5 mg/1 pyridoxine HC1) plus 200 ^g/ml carbenicillin and 100 
|ig/ml kanamycin. After roots have formed, the plants are 
transferred to soil and are gradually hardened off. 

These preliminary experiments demonstrate that 
recovering transgenic plants expressing the ADPGPP gene 
under the control of the En-CaMV35S promoter is problematic. 
One potato plant was produced on a sucrose containing medium, 
but when removed from the medium and placed in soil, it did not 
survive. This result is not unexpected. The En-CaMV35S 
promoter is a constitutive promoter and causes expression of the 
ADPGPP in all tissues of the plant. The constitutive expression 
of the ADPGPP gene most likely causes a deprivation of the 
sucrose supply to the growing parts of the plant due to the 
ADPGPP mediated conversion of sucrose to starch in the sugar 
exporting cells and tissues of the plant. Thus, this example 
illustrates the expression of ADPGPP in plant cells and the 
preference, in most cases, that the ADPGPP be expressed 
specifically in the target tissue, such as the tuber of a potato or 
the fruit of a tomato. One of ordinary skill in the art would be 
able to select from a pool of plants transformed with the En- 
CaMV35S promoter, a plant expressing ADPGPP within the 
desired range. 

Example 3 

Potato tissue has also been transformed to express a 
CTP/ADPglucose pyrop"hosphorylase fusion polypeptide driven 
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by a patatin promoter. This construct causes specific expression 
of the ADPGPP in potato tubers and increases the level of starch 
in thetubers. 

The vector used in the potato transformation is a 
derivative of the Agrobacterium mediated plant transformation 
vector 0MON886. The pMON886 plasmid is made up of the 
following well characterized segments of DNA. A 0.93 kb 
fragment isolated from transposon Tn7 which encodes bacterial 
spectinomycin/streptomycin (Spc/Str) resistance and is a 
° determinant for selection in E. coli and Agrobactertum 
tumefaciens (Fling et al., 1985). This is joined to a chimeric 
kanamycin resistance gene engineered for plant expression to 
allow selection of the transformed tissue. The chimeric gene 
consists of the 0.35 kb cauliflower mosaic virus 35S promoter (P- 
15 35S)(Odelletal..l985). the 0.83 kb neomycin phosphotransferase 
typell gene (NPTII). and the 0.26 kb S'-non-translated region of 
Z nopaline synthase gene (NOS 3') (Fraley et al.. 1983). The 
next segment is a 0.75 kb origin of replication from the RX2 
plasmid (ori-V) (Stalker et al., 1981). It is joined to a 3.1 kb Sail 
20 to Pvul segment of pBR322 which provides the origin of 
replication for maintenance in E. coli (ori-322) and the bom site 
for the conjugation^ transfer into the Agrobacterium 
tumefaciens cells. Next is a 0.36 kb Pvul fragment from the 
p TiT37 plasmid which, contains the nopaline-type T-DNA right 

25 border region (Fraley etal., 1985). 

The glgC16 gene was engineered for expression 
primarily in the tuber by placing the gene under the control of a 
tuber-specific promoter. The GlgC16 protein was directed to the 
plastids within the plant cell due to its synthesis as a C-termmal 
30 fusion with a N-terminal protein portion encoding a chloroplast 
targeting sequence (CTP) derived from that from the SSU 1A 
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gene from Arabidopsis thaliana (Timko et al. f 1989). The CTP 
portion is removed during the import process to liberate the 
GlgCl6 enzyme. Other plant expression signals also include the 
3' polyadenylation sequences which are provided by the NOS 3' 
sequences located downstream from the coding portion of the 
expression cassette. This cassette was assembled as follows: 
The patatin promoter was excised from the pBI24L3 plasmid as 
a HindllLBamHI fragment (The pBI241.3 plasmid contains the 
patatin- 1 promoter segment comprising from the AccI site at 
1323 to the Dral site at 2289 [positions refer to the sequence in 
Bevan et aL, 1986] with a HindlTL linker added at the former and 
a BarnUI linker added at the latter position; Bevan et al. f 1986) 
and ligated together with the CTPl-glgCl6 fusion (the BgllhSacl 
fragment from pMON20102 - see Example 1) and pUC-type 
plasmid vector cut with HindHI and SacI (these cloning sites in 
the vector are flanked by NotI recognition sites). The cassette 
was then introduced, as a NotI site in pMON886 t such that the 
expression of the glgC16 gene is in the same orientation as that 
of the NPTII (kanamycin) gene. This derivative is pMON20113 
which is illustrated in Figure 7. 

The pMON20113 vector was mobilized into disarmed 
Agrobacteriurn tumefaciens strain by the triparental 
conjugation system using the helper plasmid pRK2013 (Ditta et 
ah, 1980). The disarmed strain ABI was used t carrying a Ti 
plasmid which was disarmed by removing the phytohormone 
genes responsible for crown gall disease. The ABI strain is the 
A208 Agrobacteriurn tumefaciens carrying the disarmed pTiC58 
plasmid pMP90RK (Koncz and Schell, 1986). The disarmed Ti 
plasmid provides the trfA gene functions required for 
autonomous replication of the pMON vector after the 
conuugation into the ABI strain; When the plant tissue is 
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incubated with the ABI::pMON conjugate, the vector is 
transferred to the plant cells b, the vir funcuons encoded by the 

disarmed pMP90RK Ti plasmid. ., ■ ,„ 

The pMON20113 construct is then transformed uito the 
Russet Burbank potato variety. To transform Ru.se. Burbank 
potatoes, sterile shoot culture, of Russet Burbank are 
ILained in sundae cup. containing 8 ml of PM medium 
augmented with 25 mg/L ascorbic acid (MurasMge and Skoog 
MS) inorganic salts. 30 g/1 sucrose. 0.17 * NaH^O. 0.4 
' mg 1 thiamine-HCl, and 100 mgA myo-inositol. solidified w.th 2 
gA Gelrite at pH 6.0). When shoots reach approximately 5 cm m 
fength stem internode segments of 3-5 mm are excsed and 
novated with a 1:10 dilution of an overnight culture . 
^Cerium ****** from a 4 day old plate cutare. The 
6 Zn explants are co-cultured for 2 days at 20-C on a stenl fcUUr 
Daper plKed over 1.5 ml of a tobacco cell feeder layer overhud on 
VlO P medium (W0 strength MS inorganic salts and orgamc 
addenda without casein a. in Jarret «t al. (1980,. 30 gA sucrose 
and 8 0 g/1 agar). Following co-culture, the explants are 
» ^ats e JdVfull strong P.! medium for caUu. inducUon. 
composed of MS inorganic salts, organic addmon. a. m Jarret et 
rfwSO). with th. exception of ce«uv. 5.0 mgA xea*. nbo..de 
7™ and 0 10 mgA naphthaleneacetic acid KM (Jarret et al 
ZZ 80b, clnialun<5O0mgA>andcefo^ea00m^ 
25 .« included to inhibit bacterial growth, and 100 mgA 
kanamycin is added to select for transformed cells. 
TransfoLed potato plants expressing the patatin promote^ 
CTP/ADPglucose pyrophosphorylase - NOS gene show an 
increased starch content in the tuber. 
* After 4 weeks, the explants are transferred to med.um 

of the same composition, but with 0.3 mgA gibberellic aad (GA3, 
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replacing the NAA (Jarret et al. t 1981) to promote shoot 
formation. Shoots begin to develop approximately 2 weeks after 
transfer to shoot induction medium. These shoots are excised 
and transferred to vials of PM medium for rooting. After about 4 
weeks on the rooting medium, the plants are transferred to soil 
and are gradually hardened off. Shoots are tested for kanamycin 
resistance conferred by the enzyme neomycin phospho- 
transferase II, by placing the shoots on PM medium for rooting, 
which contains 50 mg/L kanamycin, to select for transformed 
cells. 

Russet Burbank Williams plants regenerated in culture 
were transplanted into 6 inch (-15.24 cm) pots and were grown to 
maturity under greenhouse condtions. Tubers were harvested 
and were allowed to suberize at room temperature for two days. 
All tubers greater than 2 cm. in length were collected and stored 
at 9°C under high humidity. 

Specific gravity (SG) was determined 3 days after 
harvest for the largest 2 or 3 tubers from each plant, with typical 
weights being 20-40 grams per tuber. Specific gravity 
calculations were performed by the weight in air less weight in 
water method, where SG = weight in air/( weight in air - weight 
in water). Calculations for percent starch and percent dry 
matter based on SG were according to the following formulas 
(von Scheelem, 1937): 

% starch = 17.546 + (199.07XSG - 1.0988) 

% dry matter = 24.182 + (21L04XSG - 1.0988). 

Western blot analysis was performed on protein 
extracted from fresh, center sections of tuber tissue as described 
for tomato leaf tissue. Starch analysis was performed on similar 
fresh tuber sections as described (Lin, 1988a). Briefly, 
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approximately 300 mg. center sections were cut. placed in 1.5 ml 
centrifuge tubes, and frozen on dry ice. The tissue was then 
dried to a stable weight in a Savant Speed-Vac Concentrator, and 
final dry weight was determined. Starch content was 
determined using approximately 60 mg. of dry material from 
each tuber. Soluble sugars were first removed by extracting 
three times with 1 ml of 80% ethanol at 70»C. for 20 minutes per 
treatment. After the final incubation, all remaining ethanol 
was removed by desiccation in a Speed Vac Concentrator. The 
solid material was resuspended in 400 ul 0.2 M potassium 
hydroxide, ground, and then incubated for 30 minutes at 100'C to 
soiubUize the starch. The solutions were cooled and neutralized 
by addition of 80 ul IN acetic acid. Starch was degraded to 
glucose by treatment with 14.8 units of pancreatic alpha-amylase 
15 (Sigma Chemical. St Louis) for 30 minutes at 37«C. followed by 
10 units of amyloglucosidase (Sigma Chemical. St Louis) for 60 
minutes at 55«C. Glucose released by the enzymatic digestions 
was measured using the Sigma Chemical (St. Louis) hexokinase 

lot- , 
20 Western blot and quantitative starch analyses were 

performed on center cuts from tubers generated under standard 

greenhouse conditions. Tubers from potato plants expressing E. 

coli ADPGPP contain on average 26.4% higher levels of startch 

than controls. The range of individual data points shows that 

25 two distinct populations exist with respect to starch content One 

population, represented by the control tubers, range m starch 

content from 10.2% up to 15%. with an average starch content of 

12.67%. The second population represents expressors of E. con 

ADPGPP, which range in starch content from 12.1% up to 

30 19 1%. with an average of 16%. The observed increase in starch 

content correlated with expression levels of E. coli ADPGPP, 
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demonstrating that this expression leads to an increase in 
starch content in potato tubers. 

Specific gravity was determined for the largest 2 or 3 
tubers from each of 36 independent transformants by the weight 
in air less weight in water method (Kleinkopf, 1987). The data 
show that tubers expressing E. coli ADPGPP had a significant 
increase in specific gravity compared to controls. On average, 
the specific gravity increased from 1.068 in control tubers up to 
1.088 in transgenic tubers (Table la), with the best lines 
averaging specific gravities of about 1.100. Specific gravity 
values varied among tubers of the same plant, as well as 
between tubers from different plants, as expected. However, only 
lines expressing E. coli ADPGPP produced tubers with elevated 
specific gravities, and these increases roughly correlated with 
the levels of glgC16 expression. Starch and dry matter content 
increased on average 35.0% and 23.9% respectively in tubers 
expressing E. coli ADPGPP, with the best lines containing 
approximately 59.3% and 40.6% increases, respectively. 

The starch content determined by the glucose method 
for a total of 26 potato lines was compared with the starch 
content calculated for these same tubers using specific gravity 
measurements. The levels of starch as calculated from specific 
gravity were in good agreement, with that determined directly 
(Table lb). For example, tubers expressing E. coli ADPGPP 
contained 16.01%. starch as determined by quantitative analysis 
versus 16.32% as determined by specific gravity. When 
increases in individual lines were examined, the experimentally 
determined starch content strongly correlated with the observed 
increase in dry matter (and expression of the glgC16 gene). 
Therefore, the observed increase in dry matter content in tubers 
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expressing E. coli ADPGPP is Urge* due to the increased 
deposition of starch. 

1>Me 1 

Average Average Average 

a) ***** *J*^ 

i *t ai\ 2L90 

£ ,oHADPGPP.(15) 1.088(0. 01 2) 
Controls <*» 

° The number of plants 

or three tubers per plant ^^g^Js). Percent 
deviation follows f eaf " ^^d from the average specific 

not express E. coh ADPGPP. 

Avg*Starch Avg*Starch 

„ ,~ 16.01(2.00) 
aeoliADPOPP^ll) 16.32(1-47 

23 Controls tt» "- 96(1 " 37) 

Ave ra g e values for percent ^*K^ffi5ffi 
enzymatic degradation to starch^conw deviations are 

ffS^T5==£& SOUSES* 
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ES2nmk t enzyme ADPGPP is encoded t, a 
IMtt Both E. co/i and plant ADPGPFs are subject to Ugh 
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regulation, with the bacterial enzyme being activated by fructose 
1,6-bisphosphate and inhibited by AMP (Preiss, 1984), while the 
plant enzymes are activated by 3-phosphoglycerate and inhibited 
by Pi (Copeland and Preiss, 1981; Preiss, 1984). Several mutants 
o(E.coli ADPGPP have been characterized and the kinetic 
properties of a few are summarized and compared in Table 2. 
(Romeo, T. and Preiss, J. T 1989). 



Table 2 



10 



15 



Strain 
wild type 
SG6 
CL1136 
61B 



Glycogen 

accumulation 

(mr/r cpMO 

20 

35 

74 

70 



Fructose 
1,6-bi phosphate 

68 
22 
5.2 
15 



AMP 

75 
170 
680 
860 



It has been demonstrated that expression of the glgC16 
variant, found in E. coli strain 618, leads to enhanced starch 
biosynthesis in plant cells. Expression of other bacterial 
ADPGPP enzymes in plant cells also enhance starch content 

Expression of the wild type glgC gene also leads to 
increased starch content. The wild type glgC gene, contained on 
an E. coli genomic clone designated pOP12 (Okita et al. t 1981) 
was isolated in a manner similar to that described for the 
isolation of the glgC16 gene described in Example 1. Briefly, an 
Ncol site was introduced at the 5* translational 6tart site and a 
Sad site was introduced just 3' of the termination codon by the 
PCR reaction using the QSP3 and QSM10 primers described in 
Example 1. The resultant Ncol-Sacl fragment was ligated into 
the vector pMON20102 idescribed in Example 1) previously 
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digested with Ncol and Sad. giving the plasmid pMON16937. 
The VSsu-glgC chimeric gene was constructed hy ligatmg an 
XHol^lll restriction fragment coning ^ 
(Timko et al.. 1985), the Bglll-Sacl fragment from pMON16937 
comprising the CTPV*C gene, and the plant transformaUon 
vector PMON977 digested with Xhol and Sad. to form 
pMON16938 (Figure 8). The pMON977 plasmid 
following well characterized DNA segments (Figure 9) First 
the 0 93 Kb fragment isolated from transposon Tn7 winch 
encodes bacterial spectinomycin/streptomycin resistance 
(Spc/Str). and is a determinant for selection in * coU and 
Atrium tumefaciens (Fling et al., 1985). Thas is jomed to 
the chimeric kanamycin resistance gene engineered for plant 
expression to allow selection of the transformed fcssue^ The 
chimeric gene consists of the 0.35 Kb cauliflower mosaad vxrus 
35S promoter (P-35S)(Odell et al.. 1985). the 0.83 Kb neomycn* 
phosphotransferase typell gene (NPTII), and the 0.26 Kb 3 - 
nontranslated region of the nopaline synthase gene (NOS 3 
^ley et al.. 1983). The next segment is the 0.75 
^Ucation rom the RK2 plasmid (ori-V) (Stalker et al.. .198U 
This is joined to the 33.1 Kb Sail to Pvul fragment from. pBR322 
which provides the origin of replication for maintenance ux E 
call (ori-322). and the bom site for the conjugational transfer urto 
tl bacterium tumefaciens cells. Next is the 0.36 Kb Pvulto 
Bdl fragment from the P TiT37 plasmid which con^ the 
nopaline-type T-DNA right border region (Fraley et a 1985^ 
The last segment is the expression cassette cons.ung.of the 0.65 
Kb cauliflower mosaic virus (CaMV) 35S promoter enhan d by 
duplication of the promoter sequence (P-E35S) (Kay « 
synthetic multilinker with several unique clonings, and the 
0 7 Kb 3" nontranslated region of the pea rbcS-E9 gene (E. 3 
(Coruzri et al.. 1984; Morelli et al.. 1985). The plasnud was 
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mated into Agrobacterium tumefaciens strain ABI, using the 
triparental mating system, and used to transform Lycopersicon 
esculentum cv. UC82B. 

Tomato plant cells are transformed utilizing the 
Agrobacterium strains described above generally by the method 
as described in McCormick et al. (1986). In particular, 
cotyledons are obtained from 7-8 day old seedlings. The seeds are 
surface sterilized for 20 minutes in 30% Clorox bleach and are 
germinated in Plantcons boxes on Davis germination media. 
Davis germination media is comprised of 4.3g/l MS salts, 20g/l 
sucrose and 10 mls/1 Nitsch vitamins, pH5.8. The Nitsch 
vitamin solution is comprised of 100mg/l myo-inositol, 5mg/l 
nicotinic acid, 0.5mg/l pyridoxine HC1, 0.5mg/l thiamine HC1, 
0.05mg/l folic acid, 0.05mg/l biotin, 2mg/l glycine. The seeds are 
allowed to germinate for 7-8 days in the growth chamber at 25°C # 
40% humidity under cool white lights with an intensity of 80 
einsteins m-2 8 -i. The photoperiod is 16 hours of light and 8 hours 
of dark. 

Once germination has occurred, the cotyledons are 
explanted using a #15 feather blade by cutting away the apical 
meristem and the . hypocotyl to create a rectangular explant. 
These cuts at the short ends of the germinating cotyledon 
increase the surface area for infection. The explants are bathed 
in sterile Davis regeneration liquid to prevent desiccation. Davis 
regeneration media is composed of IX MS salts, 3% sucrose, IX 
Nitsch vitamins, 2.0 mg/1 zeatin, pH 5.8. This solution is 
autoclaved with 0.8% Noble Agar. 

The cotyledons are pre-cultured on "feeder plates" 
composed of Calgene media containing no antibiotics. Calgene 
media is composed of 4.3g/l MS salts, 30g/l sucrose, 0.1g/l myo- 
inositol, 0.2g/l KH 2 P0 4 , 1.45mls/l of a 0.9mg/ml solution of 
thiamine HC1, 0.2mls of a 0.5mg/ml solution of kinetin and 
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„ . . , , .. -r o 4 D. this solution is adjusted to 
0.1ml of a 0.2mg/ml solution of 2,4 u, uu 

nH 6 0 with KOH. These plates are overlaid with 1.5-2.0 mis ot 
pH 6.0 with *u Whatman filter 

tobacco 2C0 05K media is composed 

which is soaked in 2CUU&K. mem M nnnY «toeW 

,x •-*,,,.«» i ml B5 vitamins (1000X stocic), 
of 4.3g/l Gibco MS salt mixture, 1ml do viia 

« i n PPPA from 2mg/ml stock, and lOui/i 
30e/l sucrose, 2mls/l PCPA irom w 

« e ; i c^rW The cotyledons are cultured for 1 

^ "ntSty of 40-50 einstein, with a continuous Ught 

Ph0tOPen ct»edo DS are then — d 

L concentration of the ^m&aCertam «» approxtmatel, 5x10 

The cotton, are ^^^^ 
, oltttio n for Mte""d are —1, 

^for * da,. After the two da,.. 

„ ee.ection ^.^^£^3^^ 

r »r 6 transferred to fresh Davis regeneration plates 

r^nV— n and kanamydn at the same levels. The 

^ent is scored for transformants at this time. The callus 
experiment is scored i inte rvals and any 

""aerate. Shoot, deveiop within 3, 
m0nth5 ' Once shoota develop, they areexcised cleanly *on> 

plates contain O.da mo . e rtri th - se i ec tion 

• TKese shoots form roots on tne seietuu 

500ug/ml carbenicilhn. lhese snoou, 
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media within two weeks. If no shoots appear after 2 weeks, 
shoots are trimmed and replanted on the selection media. Shoot 
cultures are incubated in percivals at a temperature of 22°C. 
Shoots with roots are then potted when roots are about 2cm in 
5 length. The plants are hardened off in a growth chamber at 

21°C with a photoperiod of 18 hours light and 6 hours dark for 2-3 
weeks prior to transfer to a greenhouse. In the greenhouse, the 
plants are grown at a temperature of 26°C during the day and 
21°C during the night. The photoperiod is 13 hours light and 11 
10 hours dark and allowed to mature. 

Transgenic tomato plants transformed with 
pMONl6938 were generated and screened by Western blot 
analysis for the glgC gene product. For Western blot analysis, 
proteins were extracted from leaf or stem tissue by grinding 1:1 
15 in 100 mM Tris pH7.5, 35 mM KC1, 5 mM dithiothreitol, 5 mM 
ascorbate, 1 mM EDTA, 1 mM benzamidine, and 20% glycerol. 
The protein concentration of the extract was determined using 
the Pierce BCA method, and proteins were separated on 3-17% 
SDS polyacrylamide gels. E. coli ADPGPP was detected using 
20 goat antibodies raised against purified E. coli ADPGPP and 
alkaline phosphatase conjugated rabbit anti-goat antibodies 
(Promega, Madison, WI). In most plants expressing wild type 
E. coli ADPGPP, levels of Kcoli ADPGPP were on 0.1% of the 
total extractable protein. For starch analysis, single leaf 
25 punches were harvested during late afternoon from 3-4 different, 
young, fully-expanded leaves per greenhouse grown plant. The 
leaf punches from each plant were combined and fresh weights 
were determined using a Mettler analytical balance. Total fresh 
weight per sample ranged from 60-80 mg. Soluble, sugars were 
30 first removed by extracting three times with 1 ml of 80% ethanol 
at 70°C for 20 minutes per treatment. After the final incubation, 
all remaining ethanol was removed by desiccation in a Speed 
Vac Concentrator. The solid material was resuspended in 400 pi 
0.2 M potassium hydroxide, ground, and then incubated for 30 
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nunutea at 100'C to solubilize the starch. The Bolutions were 
cooled and then neutralized by addition of 80 »1 IN aceUc aad 
Starch was degraded to glucose by treatment with 14.8 unit, of 
pancreatic alpha-amylase (Sigma Chemical. St. Louis) for 30 
nrinutes at 37°C. followed by 10 units of amyloglucosidase (Sigma 
Chemical. St. Louis) for 60 minutes at 55°C. Glucose released by 
the enzymatic digestions was measured using the Sigma 
Chemical (St. Louis) hexokinase kit. and these values were used 
to calculate starch content. 

Leaves from tomato plants expressing the glgC gene 
from the Ssu promoter contain on average 29% higher.levels of 
starch than controls, with the best line showing a 107% increase 
(Table 3). 
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Table 3 

Average Standard 

* s^ch Pnriatim 

E. coli ADPGPP+ (7) 4.54 2- 1 

Controls (8) 352 L9 

The number of lines screened are in parentheses. 
Thus, other ADPGPP's with different kinetic properties are also 
effective in increasing starch content in transgenic plants. It 
should be noted that high level expression of unregulated 
ADPGPP mutants in leaf tissue is undesiraable since it mil 
cause adverse effects on growth and development of the plants. 
In fact, use of the glgC16 gene in place of glgC in the above 
experiments did not result in regeneration of transformants 
expressing high levels of the glgC16 gene product. 

To express glgC from the patatin promoter, the same 
Bglll-Sacl CTP1-^C fragment from pMON16937 and a Hindis 
BamKl fragment containing the patatin promoter from the 
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plasmid pB 124 1.3 were ligated into the binary vector pMON 10098 
(Figure 11), digested with HindJU and Sad, to give the plasmid 
PMON16950 (Figure 10) The pBI241.3 plasmid contains the 
patatin-1 promoter segment comprising from the Accl site at 
1323 to the Dral site at 22389 [positions refer to the sequence in 
Bevan et al., 1986] with a Hindlll linker added at the latter 
position. The pMON10098 plasmid contains the following DNA 
regions, moving clockwise around Figure 11. 1) The chimeric 
kanamycin resistance gene engineered for plant expression to 
allow selection of the transformed, tissue. The chimeric gene 
consists of the 0.35 Kb cauliflower mosaic virus 35S promoter (P- 
35S) (Odell et al, 1985), the 0.83 Kb neomycin phosphotransferase 
typell gene (KAN), and the 0.26 Kb 3*-nontranslated region of the 
nopaline synthase gene (NOS 3') (Fraley et aL, 1983); 2) The 0.45 
Kb Clal to the Dral fragment from the pTi 15955 octopine Ti 
plasmid, which contains the T-DNA left border region (Barker et 
ah, 1983); 3) The 0.75 Kb segment containing the origin of 
replication from the RK2 plasmid (ori-V) (Stalker et al., 1981); 4) 
The 3.0 Kb Sail to PstI segment of pBR322 which provides the 
origin of replication for maintenance in E. coli (ori-322), and the 
bom site for the conjugations! transfer into the Agrobacteriurn 
tumefaciens cells; 5) The 0.93 Kb fragment isolated from 
transposon Tn7 which encodfes bacterial spectino- 
mycin/streptomycin resistance (Spc/Str) (Fling et al., 1985), and 
is a determinant for selection in E. coli and Agrobacteriurn 
tumefaciens; 6) The 0.36 Kb Pvul.to Bell fragment from the 
pTiT37 plasmid, which contains the nopaline-type T^DNA right 
border region (Fraley et al., 1985); and 7) The last segment is the 
expression cassette consisting of the 0.65 Kb cauliflower mosaic 
virus (CaMV) 35S promoter enhanced by duplication . of the 
promoter sequence -(P-E35S) (Kay et al., 1987), a synthetic 
multilinker with several unique, cloning sites, and the 0.7 Kb 3' 
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plasmid pBI241.3 were ligated into the binary vector pMON10098 
(Figure 11), digested with HindlU and Sad, to give the plasmid 
pMON 16950 (Figure 10) The pBI241.3 plasmid contains the 
patatin-1 promoter segment comprising from the Ace I site at 

5 1323 to the Dral site at 22389 [positions refer to the sequence in 

Bevan et al., 1986] with a Hindlll linker added at the latter 
position. The pMON10098 plasmid contains the following DNA 
regions, moving clockwise around Figure 11. 1) The chimeric 
kanamycin resistance gene engineered for plant expression to 

10 allow selection of the transformed tissue. The chimeric gene 
consists of the 0.35 Kb cauliflower mosaic virus 35S promoter (P- 
35S) (Odell et al. f 1985), the 0.83 Kb neomycin phosphotransferase 
typell gene (KAN), and the 0.26 Kb 3*-nontranslated region of the 
nopaline synthase gene (NOS 3') (Fraley et al., 1983); 2) The 0.45 

15 Kb Clal to the Dral fragment from the pTi 15955 octopine Ti 
plasmid, which contains the T-DNA left border region (Barker et 
al., 1983); 3) The 0.75 Kb segment containing the origin of 
replication from the RK2 plasmid (ori-V) (Stalker et al., 1981); 4) 
The 3.0 Kb Sail to PstI segment of pBR322 which provides the 

20 origin of replication for maintenance in E. coli (ori-322), and the 
bom site for the conjugational transfer into the Agrobacterium 
tumefaciens cells; 5) The 0.93 Kb fragment isolated from 
transposon Tn7 which encodfes bacterial spectino- 
mycin/streptomycin resistance (Spc/Str) (Fling et aL, 1985), and 

25 is a determinant for selection in E. coli and Agrobacterium 
tumefaciens; 6) The 0.36 Kb PvuLto Bell fragment from the 
pTiT37 plasmid, which contains the nopaline-type 1VDNA right 
border region (Fraley et al., 1985); and 7) The last segment is the 
expression cassette consisting of the 0.65 Kb cauliflower mosaic 

30 virus (CaMV) 35S promoter enhanced by duplication of the 
promoter sequence -(P-E35S) (Kay et al., 1987), a synthetic 
multilinker with several unique, cloning sites, and the 0.7 Kb 3* 
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nontranslaud region of the pea rbcS-E9 gene (E9 3') (Corn** et 
al 1984; Morelli et al., 1985). The plasmid was mated into 
Agrobacterium tumefaciens strain ABI, using the triparental 

mating system, and used to transform Russet Burbank line 
5 - Williams 82. Expression of glgC from the patatin promoter 

(pMON16950) in potato also results in enhanced starch content 

in tubers. 

In a manner similar to that described for the wild type 
glgC gene and for the glgC16 mutant gene, the mutant glgCSG5 
10 was also expressed in plants and results in an enhancement of 
starch content. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Kishore, Ganeah M . 

(ii) TITLE Of INVENTION: Increased Starch Content in Pla 

(iii) NUMBER OF SEQUENCES: 23 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Monsanto Co. 

(B> STREET: 700 Chesterfield Village Parkway 

(C) CITY: St. Louis 

(D) STATE: Missouri 

(E) COUNTRY: USA 

(F) ZIP: 63198 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

<vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: McBride, Thomas P. 

(B) REGISTRATION NUMBER: 32706 

(C) REFERENCE /DOCKET NUMBER: 38-21 (10530) A 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (314) 537-7357 

(B) TELEFAX: (314) 537-6047 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1296 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

<B) LOCATION: 1..1293 



<xi) SEQUENCE DESCRIPTION : SEQ ID NO:l: 

ATG GTT AGT TTA GAG AAG AAC GAT CAC TTA ATG TTG GCG CGC CAG CTG 4 8 

Met Val Ser Leu Glu Lys Asn Asp His Leu Met Leu Ala Arg Gin Leu 
1 5 10 15 

CCA TTG AAA TCT GTT GCC CTG ATA CTG GCG GGA GGA .CGT GGT ACC CGC 96 
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Pro Leu Lys Ser Val Ala Leu lie Leu Ala Gly Gly Arg ^ T 
20 25 30 

CTG AAG GAT TTA ACC AAT AAG CGA GCA AAA CCG GCC G7A CAC TTC GGC 14 4 

Leu Lys Asp Leu Thr Aan Lys Arg Ala Lys Pro Ala Val His Phe Glv 
35 40 45 

GOT AAG TTC CGC ATT ATC GAC TTT GCG CTG TCT AAC TGC ATC AAC TCC 192 
Gly Lys Phe Arg lie lie Asp Phe Ala Leu Ser Asn Cys lie Asn Ser 
50 55 60 

GGG ATC CGT CGT ATG GGC GTG ATC ACC CAG TAC CAG TCC CAC ACT CTG 24 0 

Gly lie Arg Arg Met Gly Val He Thr Gin Tyr Gin Ser His Thr Leu 
65 70 75 80 

GTG CAG CAC ATT CAG CGC GGC TGG TCA TTC TTC AAT GAA GAA ATG AAC 2 88 

Val Gin His lie Gin Arg Gly Trp Ser Phe Phe. Asn Glu Glu Met Asn 
85 90 95 

GAG TTT GTC GAT CTG CTG CCA GCA CAG CAG AGA ATG AAA GGG GAA AAC 33 6 

Glu Phe Val Asp Leu Leu Pro Ala Gin Gin Arg Met Lys Gly Glu Asn 
100 105 up 

TGG TAT CGC GGC ACC GCA GAT GCG GTC ACC CAA AAC CTC GAC ATT ATC 384 
Trp Tyr Arg Gly Thr Ala Asp Ala Val Thr Gin Asn Leu Asp He He 
:1S 120 . 125 

CGT CGT TAT AAA GCG GAA TAC GTG GTG ATC CTG GCG GGC GAC CAT ATC 4 32 

Arg Arg Tyr Lys Ala Glu Tyr Val Val He Leu Ala Gly Asp His He 
130 135 140 

TAC AAG CAA GAC TAC TCG CGT ATG CTT ATC GAT CAC GTC GAA AAA GGT 4 80 

Tyr Lys Gin Asp Tyr Ser Arg Met Leu He Asp His Val Glu Lys Glv 
145 150 155 160 

GTA CGT TGT ACC GTT GTT TGT ATG CCA GTA CCG ATT GAA GAA GCC TCC 528 
Val Arg Cys Thr Val Val Cys Met Pro Val Pro He Glu Glu Ala Ser 
165 170 175 

GCA TTT GGC GTT ATG GCG GTT GAT GAG AAC GAT AAA ACT ATC GAA TTC 576 
Ala Phe Gly Val Met Ala Val Asp Glu Asn Asp Lys Thr He Glu Phe 
180 185 190 

GTG GAA AAA CCT GCT AAC CCG CCG TCA ATG CCG AAC GAT CCG AGC AAA 624 
Val Glu Lys Pro Ala Asn Pro Pro Ser Met Pro Asn Asp Pro Ser Lys 
195 200 205 

TCT CTG GCG AGT ATG GGT ATC TAC GTC TTT GAC GCC GAC TAT CTG TAT 672 
Ser Leu Ala Ser Met Gly He Tyr Val Phe Asp Ala Asp Tyr Leu Tvr 
210 215 220 

GAA CTG CTG GAA GAA GAC GAT CGC GAT GAG AAC TCC AGC CAC GAC TTT 720 
Glu Leu Leu Glu Glu Asp Asp Arg Asp Glu Asn Ser Ser His Asp Phe 
225 230 235 240 

GGC AAA GAT TTG ATT CCC AAG ATC ACC GAA GCC GGT CTG GCC TAT GCG 7 68 

Gly Lys Asp Leu He Pro Lys He Thr Glu Ala Gly Leu Ala Tyr Ala 
245 250 255 

CAC CCG TTC CCG CTC TCT TGC GTA CAA TCC GAC CCG GAT GCC GAG CCG 816 
His Pro Phe Pro Leu Ser Cys Val Gin Ser Asp Pro Asp Ala Glu Pro 
260 265 270 

TAC TGG CGC GAT GTG GGT ACG CTG GAA GCT TAC TGG AAA GCG AAC CTC 8 64 
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Tyr Trp Arg Asp Val Gly Thr Leu Glu Ala Tyr Trp Lys Ala ; 
275 280 285 

GAT CTG GCC TCT GTG GTG CCG AAA CTG GAT ATG TAC GAT CGC AAT TGG 912 
Asp Leu Ala Ser Val Val Pro Lys Leu Asp Met Tyr Asp Arg Asn Trp 
290 295 300 

CCA ATT CGC ACC TAC AAT GAA TCA TTA CCG CCA GCG AAA TTC GTG CAG 960 
Pro lie Arg Thr Tyr Asn Glu Ser Leu Pro Pro Ala Lys Phe Val Gin 
305 310 315 320 

GAT CGC TCC GGT AGC CAC GGG ATG ACC CTT AAC TCA CTG GTT TCC GGC 1008 
Asp Arg Ser Gly Ser His Gly Met Thr Leu. Asn Ser Leu Val Ser Gly 
325 330 335 

GGT TGT GTG ATC TCC GGT TCG GTG GTG GTG CAG TCC GTT CTG TTC TCG 1056 
Gly Cys Val lie Ser Gly Ser Val Val Val Gin Ser Val Leu Phe Ser 
340 345 350 

CGC GTT CGC GTG AAT TCA TTC TGC AAC ATT GAT TCC GCC GTA TTG TTA 1104 
Arg Val Arg Val Asn Ser Phe Cys Asn lie Asp Ser Ala Val Leu Leu 
355 360 365 

CCG GAA GTA TGG GTA GGT CGC TCG TGC CGT CTG CGC CGC TGC GTC ATC 1152 
Pro Glu Val Trp Val Gly Arg Ser Cys Arg Leu Arg Arg Cys Val lie 
3*70 375 380 

GAT CGT GCT TGT GTT ATT CCG GAA GGC ATG GTG ATT GGT GAA AAC GCA 1200 
Asp Arg Ala Cys Val He Pro Glu Gly Met Val He Gly Glu Asn Ala 
385 390 395 400 

GAG GAA GAT GCA CGT CGT TTC TAT CGT TCA GAA GAA GGC ATC GTG CTG 12 48 

Glu Glu Asp Ala Arg Arg Phe Tyr Arg Ser Glu Glu Gly He Val Leu 
405 410 415 

GTA ACG CGC GAA ATG CTA CGG AAG TTA GGG CAT AAA CAG GAG CGA 1293 
Val Thr Arg Glu Met Leu Arg Lys Leu Gly His Lys Gin Glu Arg 
420 425 430 

TAA 1296 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Val Ser Leu Glu Lys Asn Asp His Leu Met Leu Ala Arg Gin Leu 
15 10 15 

Pro Leu Lys Ser Val Ala Leu He Leu Ala Gly Gly Arg Gly Thr Arg 
20 25 30 

Leu Lys Asp Leu Thr Asn Lys Arg Ala Lys Pro Ala Val His Phe Gly 
35 40 % 45 

Gly Lys Phe Arg He He Asp Phe Ala Leu Ser Asn Cys He Asn Ser 
50 55 60 
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Gly He Arg Arg Met Gly Val He Thr Gin Tyr Gin Ser His Tr 
65 70 75 BO 

Val Gin His He Gin Arg Gly Trp Ser Phe Phe Asn Glu Glu Met Asn 
85 90 95 

Glu Phe Val Asp Leu Leu Pro Ala Gin Gin Arg Met Lys Gly Glu Asn 
100 105 110 

Trp Tyr Arg Gly Thr Ala Asp Ala Val Thr Gin Asn Leu Asp He He 
115 120 125 

Arg Arg Tvr Lys Ala Glu Tyr Val Val He Leu Ala Gly Asp His He 
130 135 140 

Tyr Lys Gin Asp Tyr Ser Arg Met Leu He Asp His Val Glu Lys Gly 
145 150 155 160 

Val Arg Cys Thr Val Val Cys Met Pro Val Pro He Glu Glu Ala Ser 
165 170 175 

Ala Phe Giy Val Met Ala Val Asp Glu Asn Asp Lys Thr He Glu Phe 
180 185 190 

Val Glu Lvs Pro Ala Asn Pro Pro Ser Met Pro Asn Asp Pro Ser Lys 
195 200 205 

Ser Leu Ala Ser Met Gly He Tyr Val Phe Asp Ala Asp Tyr Leu Tyr 
210 215 * 220 

Glu Leu Leu Glu Glu Asp Asp Arg Asp Glu Asn Ser Ser His Asp Phe 
225 230 235 240 

Gly Lys Asp Leu He Pro Lys He Thr Glu Ala Gly Leu Ala Tyr Ala 
245 250 255 

His Pro Phe Pro Leu Ser Cys Val Gin Ser Asp Pro Asp Ala Glu Pro 
260 265 270 

Tyr Trp Arg Asp Val Gly Thr Leu Glu Ala Tyr Trp Lys Ala Asn Leu 
275 280 285 

Asp Leu Ala Ser Val Val Pro Lys Leu Asp Met Tyr Asp Arg Asn Trp 
290 295 300 

Pro He Arg Thr Tyr Asn Glu Ser Leu Pro Pro Ala Lys Phe Val Gin 
305 310 315 320 

Asp Arg Ser Gly Ser His Gly Met Thr Leu Asn Ser Leu Val Ser Gly 
325 330 335 

Gly Cys Val He Ser Gly Ser Val Val Val Gin Ser Val Leu Phe Ser 
340 345 350 

Arg Val Arg Val Asn Ser Phe Cys Asn He Asp Ser Ala Val Leu Leu 
355 360 365 

Pro Glu Val Trp Val Gly Arg Ser Cys Arg Leu Arg Arg Cys Val He 
370 375 380 

Asp Arg Ala Cys Val He Pro Glu Gly Met Val He Gly Glu Asn Ala 
385 390 " 395 400 
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Glu Glu Asp Ala Arg Arg Phe Tyr Arg Ser Glu Glu Gly lie V, 
405 410 4? 

Val Thr Arg Glu Met Leu Arg Lys Leu Gly His Lys Gin Glu Arg 
420 425 430 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1296 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

UoO FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1..1293 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG GTT AGT TTA GAG AAG AAC GAT CAC TTA ATG TTG GCG CGC CAG CTG 4 8 

Met Val Ser Leu Glu Lys Asn Asp Hia Leu Met Leu Ala Arg Gin Leu 
1 5 10 15 

CCA TTG AAA TCT GTT GCC CTG ATA CTG GCG GGA GGA CGT GGT ACC CGC 96 
Pro Leu Lya Ser Val Ala Leu lie Leu Ala Gly Gly Arg Gly Thr Arg 
20 25 30 

CTG AAG GAT TTA ACC AAT AAG CGA GCA AAA CCG GCC GTA CAC TTC GGC 144 
Leu Lys Asp Leu Thr Asn Lys Arg Ala Lys Pro Ala Val His Phe Gly 
35 40 45 

GGT AAG TTC CGC ATT ATC GAC TTT GCG CTG TCT AAC TGC ATC AAC TCC 192 
Gly Lys Phe Arg He He Asp Phe Ala Leu Ser Aan Cya He Aan Ser 
50 55 60 

GGG ATC CGT CGT ATG GGC GTG ATC ACC CAG TAC CAG TCC CAC ACT CTG 240 
Gly He Arg Arg Met Gly Val He Thr Gin Tyr Gin Ser His Thr Leu 
65 70 75 80 

GTG CAG CAC ATT CAG CGC GGC TGG TCA TTC TTC AAT GAA GAA ATG AAC 288 
Val Gin His He Gin Arg Gly Trp Ser Phe Phe Aan Glu Glu Met Aan 
85 90 95 

GAG TTT GTC GAT CTG CTG CCA GCA CAG CAG AGA ATG AAA GGG GAA AAC 33 6 

Glu Phe Val Aap Leu Leu Pro Ala Gin Gin Arg Met Lys Gly Glu Asn 
100 105 110 

TGG TAT CGC GGC ACC GCA GAT GCG GTC ACC CAA AAC CTC GAC ATT ATC 38 4 

Trp Tyr Arg Gly Thr Ala Asp Ala Val Thr Gin Asn Leu Asp He He 
115 120 125 

CGT CGT TAT AAA GCG GAA TAC GTG GTG ATC CTG GCG GGC GAC CAT ATC 432 
Arg Arg Tyr Lya Ala Glu Tyr Val Val He Leu Ala Gly Asp His He 
130 135 140 

TAC AAG CAA GAC TAC TCG CGT ATG CTT ATC GAT CAC GTC GAA AAA GGT 4 80 

Tyr Lys Gin Asp Tyr Ser Arg Met Leu He Asp His Val Glu Lys Gly 
145 150 - 155 160 
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GTA CGT TGT ACC GTT GTT TGT ATG CCA GTA CCG ATT GAA GAA G 

Val Arg Cys Thr Val Val Cys Met Pro Val Pro lie Glu Glu A 

165 170 175 

GCA TTT GGC GTT ATG GCG GTT GAT GAG AAC GAT AAA ACT ATC GAA TTC 576 

Ala Phe Gly Val Mec Ala Val Asp Glu Asn Asp Lys Thr lie Glu Phe 

180 185 190 

GTG GAA AAA CCT GCT AAC CCG CCG TCA ATG CCG AAC GAT CCG AGC AAA 624 

Val Glu Lys Pro Ala Asn Pro Pro Ser Met Pro Asn A3p Pro Ser Lys 

195 200 205 

TCT CTG GCG AGT ATG GGT ATC TAC GTC TTT GAC GCC GAC TAT CTG TAT 672 

Ser Leu Ala Ser Met Gly lie Tyr Val Phe Asp Ala Asp Tyr Leu Tyr 

210 215 220 

GAA CTG CTG GAA GAA GAC GAT CGC GAT GAG AAC TCC AGC CAC GAC TTT 720 

Glu Leu Leu Glu Glu Asp Asp Arg Asp Glu Asn Ser Ser His Asp Phe 

225 230 235 240 

GGC AAA GAT TTG ATT CCC AAG ATC ACC GAA GCC GGT CTG GCC TAT GCG 768 

Gly Lys Asp Leu lie Pro Lya lie Thr Glu Ala Gly Leu Ala Tyr Ala 

245 250 255 

CAC CCG TTC CCG CTC TCT TGC GTA CAA TCC GAC CCG GAT GCC GAG CCG * 816 

His Pro Phe Pro Leu Ser Cys Val Gin Ser Asp Pro Asp Ala Glu Pro 

260 265 270 

TAC TGG CGC GAT GTG GGT ACG CTG GAA GCT TAC TGG AAA GCG AAC CTC 8 64 

Tyr Trp Arg Asp Val Gly Thr Leu Glu Ala Tyr Trp Lya Ala Asn Leu 

275 280 285 

GAT CTG GCC TCT GTG GTG CCG GAA CTG GAT ATG TAC GAT CGC AAT TGG 912. 

Asp Leu Ala Ser Val Val Pro Glu Leu Asp Met Tyr Asp Arg Asn Trp 

290 295 300 

CCA ATT CGC ACC TAC AAT GAA TCA TTA CCG CCA GCG AAA TTC GTG CAG 960 

Pro lie Arg Thr Tyr Asn Glu Ser Leu Pro Pro Ala Lys Phe Val Gin 

305 310 315 320 

GAT CGC TCC GGT AGC CAC GGG ATG ACC CTT AAC TCA CTG GTT TCC GAC 1008 

Asp Arg Ser Gly Ser His Gly Met Thr Leu Asn Ser Leu Val Ser Asp 

325 330 335 

GGT TGT GTG ATC TCC GGT TCG GTG GTG GTG CAG TCC GTT CTG TTC TCG 1056 

Gly Cys Val He Ser Gly Ser Val Val Val Gin Ser Val Leu Phe Ser 

340 345 350 

CGC GTT CGC GTG AAT TCA TTC TGC AAC ATT GAT TCC GCC GTA TTG TTA 1104 

Arg Val Arg Val Aan Ser Phe Cys Asn He Asp Ser Ala Val Leu Leu 

355 360 365 

CCG- GAA GTA TGG GTA GGT CGC TCG TGC CGT CTG CGC CGC TGC GTC ATC 1152 

Pro Glu Val Trp Val Gly Arg Ser Cys Arg Leu Arg Arg Cys Val He 

370 375 380 

GAT CGT GCT TGT GTT ATT CCG GAA GGC ATG GTG ATT GGT GAA AAC GCA 1200 

Asp Arg Ala Cys Val He Pro Glu Gly Met Val He Gly Glu Asn Ala 

385 390 395 400 

GAG GAA GAT GCA CGT CGT TTC TAT CGT TCA GAA GAA GGC ATC GTG CTG 1248 

Glu Glu Asp Ala Arg Arg Phe Tyr Arg Ser Glu Glu Gly He Val Leu 

405 410 415 
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GTA ACG CGC GAA ATG CTA CGG AAG TTA GGG CAT AAA CAG GAG s 

Val Thr Arg Glu Met Leu Arg Lys Leu Gly His Lys Gin Glu ^ 

420 425 430 

TAA 1296 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Val Ser Leu Glu Lys Asn Asp His Leu Met Leu Ala Arg Gin Leu 
1 5 10 15 

Pro Leu Lys Ser Val Ala Leu lie Leu Ala Gly Gly Arg Gly Thr Arg 
20 25 30 

Leu Lys Asp Leu Thr Asn Lys Arg Ala Lys Pro Ala Val His Phe Gly 
35 40 45 

Gly Lys Phe Arg lie lie Asp Phe Ala Leu Ser Asn Cys He Asn Ser 
50 55 60 

Gly He Arg Arg Met Gly Val He Thr Gin Tyr Gin Ser His Thr Leu 
65 70 75 80 

Val Gin His He Gin Arg Gly Trp Ser Phe Phe Asn Glu Glu Met Asn 
85 90 95 

Glu Phe Val Asp Leu Leu Pro Ala Gin Gin Arg Met Lys Gly Glu Asn 
100 105 110 

Trp Tyr Arg Gly Thr Ala Asp Ala Val Thr Gin Asn Leu Asp lie He 
115 120 125 

Arg Arg Tyr Lys Ala Glu Tyr Val Val He Leu Ala Gly Asp His He 
130 135 140 

Tyr Lys Gin Asp Tyr Ser Arg Met Leu He Asp His Val Glu Lys Gly 
145 150 155 160 

Val Arg Cys Thr Val Val Cys Met Pro Val Pro He Glu Glu Ala Ser 
165 170 175 

Ala Phe Gly Val Met Ala Val Asp Glu Asn Asp Lys Thr He Glu Phe 
130 185 190 

Val Glu Lys Pro Ala Asn Pro Pro Ser Met Pro Asn Asp Pro Ser Lys 
195 200 205 

Ser Leu Ala Ser Met Gly He Tyr Val Phe Asp Ala Asp Tyr Leu Tyr 
210 215 220 

Glu Leu Leu Glu Glu Asp Asp Arg Aso Glu Asn Ser Ser His Asp Phe 
225 230 " 235 240 

Gly Lys Asp Leu He Pro Lys He Thr Glu Ala Gly Leu Ala Tyr Ala 
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245 250 

His Pro Phe Pro Leu Ser Cys Val Gin Ser Asp Pro Asp Ala Glu Pro 
260 2&S 270 

Tyr Trp Arg Asp Val Gly Thr Leu Glu Ala Tyr Trp Lys Ala Asn Leu 
275 280 235 

Asp Leu Ala Ser Val Val Pro Glu Leu Asp Met Tyr Asp Arg Asn Trp 
290 295 300 

Pro lie Arg Thr Tyr Asn Glu Ser Leu Pro Pro Ala Ly3 Phe Val Gin 
305 310 315 320 

Asp Arg Ser Gly Ser His Gly Met Thr Leu Asn Ser Leu Val Ser Asp 
325 330 335 

Gly Cys Val lie Ser Gly Ser Val Val Val Gin Ser Val Leu Phe Ser 
340 345 350 

Arg Val Arg Val Asn Ser Phe Cys Asn lie Asp Ser Ala Val Leu Leu 
355 360 365 

Pro Glu Val Trp Val Gly Arg Ser Cys Arg Leu Arg Arg Cys Val lie 
370 375 380 

Asp Arg Ala Cys Val He Pro Glu Gly Met Val He Gly Glu Asn Ala 
385 390 395 400 

Glu Glu Asp Ala Arg Arg Phe Tyr Arg Ser Glu Glu Gly He Val Leu 
405 410 415 

Val Thr Arg Glu Met Leu Arg Lys Leu Gly His Lys Gin Glu Arg 
420 425 430 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 355 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 88.. 354 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

AAGCTTGTTC TCATTGTTGT TATCATTATA TATAGATGAC CAAAGCACTA GACCAAACCT 60 

CAGTCACACA AAGAGTAAAG AAGAACA ATG . GCT TCC TCT ATG CTC TCT TCC HI 

Met Ala Ser Ser Met Leu Ser Ser 
1 5 

GCT ACT ATG GTT GCC TCT CCG GCT CAG GCC ACT ATG GTC GCT CCT TTC 159 
Ala Thr Met Val Ala Ser Pro Ala Gin Ala Thr Met Val Ala Pro Phe 
10 15 - 20 

AAC GGA CTT AAG TCC TCC GCT GCC TTC CCA GCC ACC CGC AAG GCT AAC 207 
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Aan Gly Leu Lya Ser Ser Ala Ala Phe Pro Ala Thr Arq Lvs 
25 30 35 

AAC GAC ATT ACT TCC ATC ACA AGC AAC GGC GGA AGA GTT AAC TGC ATG 25 S 

Aan Aap lie Thr Ser He Thr Ser Aan Gly Gly Arg Val Aan Cy 3 Met 
45 50 55 

CAG GTG TGG CCT CCG ATT GGA AAG AAG AAG TTT GAG ACT CTC TCT TAC 
Gin Val ?rp Pro Pro He Gly Lya Ly 3 hys Phe Glu Thr Leu Ser Tvr 
60 65 70 

CTT CCT GAC CTT ACC GAT TCC GGT GGT CGC GTC AAC TGC ATG CAG GCC 
Leu Pro Aap Leu Thr Asp Ser Gly Gly Arg. Val Aan Cya Met Gin Ala 
75 80 85 



ATG G 
Met 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 9 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Ser Ser Met Leu Ser Ser Ala Thr Met Val Ala Ser Pro Ala 
15 10 15 

; Gin Ala Thr Met Val Ala Pro Phe Aan Gly Leu Lys Ser Ser Ala Ala 
20 25 30 

; ?he Pro Ala Thr Arg Ly 3 Ala Aan Aan Aap He Thr Ser He Thr Se- 
| 35 40 45 

; Aan Gly Gly Arg Val Aan Cya Met Gin Val Trp Pro Pro He Gly Lya 

i 55 60 

i 

Lya Lya Phe Glu Thr Leu Ser Tyr Leu Pro Asp Leu Thr Asp Ser Gly 



75 80 



Gly Arg Val Aan Cya Met Gin Ala Met 
85 

(2) INFORMATION FOR SEQ ID NO:7: ' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1575 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



303 



351 



355 



Ux> FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 1565 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CC ATG GCG GCT TCC ATT GGA GCC TTA AAA TCT TCA CC7 TCT TCT AAC 47 
Met Ala Ala Ser He Gly Ala Leu Lys Ser Ser Pro Ser Ser Asn 
1 5 10 15 

AAT TGC ATC AAT GAG AG A AGA AAT GAT TCT ACA CGT GCT GTA TCC AGC 95 
Asn Cys He Asn Glu Acg Arg Asn Asp Ser Thr Arg Ala Val Ser Ser 
20 25 30 

AGA AAT CTC TCA TTT TCG TCT TCT CAT CTC GCC GGA GAC AAG TTG ATG 143 
Arg Asn Leu Ser Phe Ser Ser Ser His Leu Ala Gly Asp Lys Leu Met 
35 40 45 

CCT GTA TCG TCC TTA CGT TCC CAA GGA GTC CGA TTC AAT GTG AGA AGA 191 
Pro Val Ser Ser Leu Arg Ser Gin Gly Val Arg Phe Asn Val Arg Arg 
50 55 60 

AGT CCA ATG ATT GTG TCG CCA AAG GCT GTT TCT GAT TCG CAG AAT TCA 239 
Ser Pro Met He Val Ser Pro Lys Ala Val Ser Asp Ser Gin Asn Ser 
65 70 75 

CAG ACA TGT CTA GAC CCA GAT GCT AGC CGG AGT GTT TTG GGA ATT ATT 287 
Gin Thr Cys Leu Asp Pro Asp Ala Ser Arg Ser Val Leu Gly He He 
80 85 90 95 

CTT GGA GGT GGA GCT GGG ACC CGA CTT TAT CCT CTA ACT AAA AAA AGA 335. 
Leu Gly Gly Gly Ala Gly Thr Arg Leu Tyr Pro Leu Thr Lys Lys Arg 
100 105 110 

GCA AAG CCA GCT GTT CCA CTT GGA GCA AAT TAT CGT CTG ATT GAC ATT 383 
Ala Lys Pro Ala Val Pro Leu Gly Ala Asn Tyr Arg Leu He Asp He 
115 120 125 

CCT GTA AGC AAC TGC TTG AAC AGT AAT ATA TCC AAG ATT TAT GTT CTC 431 
Pro Val Ser Asn Cys Leu Asn Ser Asn He Ser Lys He Tyr Val Leu 
130 135 140 

ACA CAA TTC AAC TCT GCC TCT CTG AAT CGC CAC CTT TCA CGA GCA TAT 47 9 

Thr Gin Phe Asn Ser Ala Ser Leu Asn Arg His Leu Ser Arg Ala Tyr 
145 150 155 

GCT AGC AAC ATG GGA GGA TAC AAA AAC GAG GGC TTT GTG GAA GTT CTT 527 
Ala Ser Asn Met Gly Gly Tyr Lys Asn Glu Gly Phe Val Glu Val Leu 
160 165 170 175 

GCT GCT CAA CAA AGT CCA GAG AAC CCC GAT TGG TTC CAG GGC ACG GCT 57 5 

Ala Ala Gin Gin Ser Pro Glu Asn Pro Asp Trp Phe Gin Gly Thr Ala 
180 185 190 

GAT GCT GTC AGA CAA TAT CTG TGG TTG TTT GAG GAG CAT ACT GTT CTT . 623 

Asp Ala Val Arg Gin Tyr Leu Trp Leu Phe Glu Glu His Thr Val Leu 
195 200 205 

GAA TAC CTT ATA CTT GCT GGA GAT CAT CTG TAT CGA ATG GAT TAT GAA 671 
Glu Tyr Leu He Leu Ala Gly Asp His Leu Tyr Arg Met Asp Tyr Glu 
210 215 220 

AAG TTT ATT CAA GCC CAC AGA, GAA ACA GAT GCT GAT ATT ACC GTT GCC 719 
Lys Phe He Gin Ala His Arg Glu Thr Asp Ala Asp He Thr Val Ala 
225 . 230 235 

GCA CTG CCA ATG GAC GAG AAG CGT GCC ACT GCA TTC GGT CTC ATG AAG 7 67 
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Ala Leu Pro Met Asp Glu Lys Arg Ala Thr Ala Phe Gly Leu 
240 245 250 

ATT GAC GAA GAA GGA CGC ATT ATT GAA TTT GCA GAG AAA CCG CAA GGA 815 
lie Asp Glu Glu Gly Arg lie He Glu Phe Ala Glu Lys Pro Gin Gly 
260 265 270 

GAG CAA TTG CAA GCA ATG AAA GTG GAT ACT ACC ATT TTA GGT CTT GAT 8 63 

Glu Gin Leu Gin Ala Met Lys Val Asp Thr Thr lie Leu Gly Leu Asp 
275 280 285 

GAC AAG AGA GCT AAA GAA ATG CCT TTC ATT GCC AGT ATG GGT ATA TAT 911 
Asd Lys Arg Ala Lys Glu Met Pro Phe He Ala Ser Met Gly He Tyr 
290 295 300 

GTC ATT AGC AAA GAC GTG ATG TTA AAC CTA CTT CGT GAC AAG TTC CCT 959 
Val He Ser Lys Asp Val Met Leu Asn Leu Leu Arg Asp Lys Phe Pro 
305 310 315 

GGG GCC AAT GAT TTT GGT AGT GAA GTT ATT CCT GGT GCA ACT TCA CTT 1007 
Gly Ala Asn Asp Phe Gly Ser Glu Val lie Pro Gly Ala Thr Ser Leu 
320 325 330 335 

GGG ATG AGA GTG CAA GCT TAT TTA TAT GAT GGG TAC TGG GAA GAT ATT 1055 
Gly Met Arg Val Gin Ala Tyr Leu Tyr Asp Gly Tyr Trp Glu Asp He 
340 345 350 

GGT ACC ATT GAA GCT TTC TAC AAT GCC AAT TTG GGC ATT ACA AAA AAG 1103 
Gly Thr He Glu Ala Phe Tyr Asn Ala Asn Leu Gly He Thr Lys Lys 
355 360 365 

CCG GTG CCA GAT TTT AGC TTT TAC GAC CGA TCA GCC CCA ATC TAC ACC 1151 
Pro Val Pro Asp Phe Ser Phe Tyr Asp Arg Ser Ala Pro He Tyr Thr 
370 375 380 

CAA CCT CGA TAT CTA CCA CCA TCA AAA ATG CTT GAT GCT GAT GTC ACA 1199 
Gin Pro Arg Tyr Leu Pro Pro Ser Lys Met Leu Asp Ala Asp Val Thr 
385 390 395 

GAT AGT GTC ATT GGT GAA GGT TGT GTG ATC AAG AAC TGT AAG ATT CAT 1247 
Asp Ser Val He Gly Glu Gly Cys Val He Lys Asn Cys Lys He His 
400 405 410 415 

CAT TCC GTG GTT GGA CTC AGA TCA TGC ATA TCA GAG GGA GCA ATT ATA 12 95 

His Ser Val Val Gly Leu Arg Ser Cys He Ser Glu Gly Ala He He 
420 425 430 

GAA GAC TCA CTT TTG ATG GGG GCA GAT TAC TAT GAG ACT GAT GCT GAC 134 3 

Glu Asp Ser Leu Leu Met Gly Ala Asp Tyr Tyr Glu Thr Asp. Ala Asp 
435 440 445 

AGG AAG TTG CTG GCT GCA AAG GGC AGT GTC CCA ATT GGC ATC GGC AAG 1391 
Arg Lys Leu Leu Ala Ala Lys Gly Ser Val Pro He Gly He Gly Lys 
450 455 460 

AAT TGT CAC ATT AAA AGA GCC ATT ATC GAC AAG AAT GCC CGT ATA GGG 1439 
Asn Cys His He Lys Arg Ala He He Asp Lys Asn Ala Arg He Gly 
465 470 475 

GAC AAT GTG AAG ATC ATT AAC AAA GAC AAC GTT CAA GAA GCG GCT AGG 14 87 

Asp Asn Val Lys He He Asn Lys Asp Asn Val Gin Glu Ala Ala Arg 
480 485 * _ 490 495 

GAA ACA GAT GGA TAC TTC ATC AAG AGT GGG ATT GTC ACC GTC ATC AAG 1535 
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Glu Thr Asp Gly Tyr Phe He Lys Ser Gly He Val Thr Val l 
500 505 f 

GAT GCT TTG ATT CCA AGT GGA ATC ATC ATC TGA7GAGCTC 1575 
Asp Ala Leu He Pro Ser Gly He He He 
515 520 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 521 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Ala Ala Ser He Gly Ala Leu Lys Ser Ser Pro Ser Ser Asn Asn 
15 10 15 

Cys He Asn Glu Arg Arg Asn Asp Ser Thr Arg Ala Val Ser Ser Arg 
20 25 30 

Asn Leu Ser Phe Ser Ser Ser His Leu Ala Gly Asp Lys Leu Met Pro 
35 40 45 

Val Ser Ser Leu Arg Ser Gin Gly Val Arg Phe Asn Val Arg Arg Ser 
50 55 60 

Pro Met He Val Ser Pro Lys Ala Val Ser Asp Ser Gin Asn Ser Gin 
65 70 75 80 

Thr Cys Leu Asp Pro Asp Ala Ser Arg Ser Val Leu Gly He He Leu 
85 90 95 

Gly Gly Gly Ala Gly Thr Arg Leu Tyr Pro Leu Thr Lys Lys Arg Ala 
100 105 HO 

Lys Pro Ala Val Pro Leu Gly Ala Asn Tyr Arg Leu He Asp He Pro 
115 120 125 

Val Ser Asn Cys Leu Asn Ser Asn He Ser Lys He Tyr Val Leu Thr 
130 135 140 

Gin Phe Asn Ser Ala Ser Leu Asn Arg His Leu Ser Arg Ala Tyr Ala 
145 150 155 160 

Ser Asn Met Gly Gly Tyr Lys Asn Glu Gly Phe Val Glu Val Leu Ala 
165 170 175 

Ala Gin Gin Ser Pro Glu Asn Pro Asp Trp Phe Gin Gly Thr Ala Asp 
180 185 190 

Ala Val Arg Gin Tyr Leu Trp Leu Phe Glu Glu His Thr Val Leu Glu 
195 200 205 

Tyr Leu He Leu Ala Gly Asp His Leu Tyr Arg Met Asp Tyr Glu Lys 
210 215 220 

Phe He Gin Ala His Arg Glu Thr Asp itla Asp He Thr Val Ala Ala 
225 230 235 240 
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Leu Pro Met Asp Glu Lys Arg Ala Thr Ala Phe Gly Leu Met L* 
245 250 25 

Asp Glu Glu Gly Arg He He Glu Phe Ala Glu Lys Pro Gin Gly Glu 
260 265 270 

Gin Leu Gin Ala Met Lys Val Asp Thr Thr He Leu Gly Leu Asp Asp 
275 280 285 

Lys Arg Ala Lya Glu Met Pro Phe He Ala Ser Met Gly He Tyr Val 
290 295 300 

He Ser Lys Asp Val Met Leu Asn Leu Leu Arg Asp Lya Phe Pro Gly 
305 310 315 320 

Ala Asn Asp Phe Gly Ser Glu Val He Pro Gly Ala Thr Ser Leu Gly 
325 330 335 

Met Arg Val Gin Ala Tyr Leu Tyr Asp Gly Tyr Trp Glu Asp He Gly 
340 345 350 

Thr He Glu Ala Phe Tyr Asn Ala Asn Leu Gly He Thr Lys Lys Pro 
355 360 365 

Val Pro Asp Phe Ser Phe Tyr Asp Arg Ser Ala Pro He Tyr Thr Gin 
370 375 380 

Pro Arg Tyr Leu Pro Pro Ser Lys Met Leu Asp Ala Asp Val Thr Asp 
385 390 395 400 

Ser Val He Gly Glu Gly Cys Val He Lys Asn Cys Lys He His His 
405 410 415 

Ser Val Val Gly Leu Arg Ser Cys He Ser Glu Gly Ala He He Glu 
420 425 430 

Asp Ser Leu Leu Met Gly Ala Asp Tyr Tyr Glu Thr Asp Ala Asp Arg 
435 440 445 

i Lys Leu Leu Ala Ala. Lys Gly Ser Val Pro He Gly He Gly Lys Asn 
t 450 455 460 

Cys His He Lys Arg Ala He He Asp Lys Asn Ala Arg He Gly Asp 
465 470 475 480 

Asn Val Lys He He Asn Lys Asp Asn Val Gin Glu Ala Ala Arg Glu 
485 490 495 

Thr Asp Gly Tyr Phe lie Lys Ser Gly He Val Thr Val He Lys Asp 
500. 505 510 

Ala Leu He Pro Ser Gly He He He 
515 520 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 1519 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1410 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

AAC AAG ATC AAA CCT GGG GTT GCT TAC TCT GTG ATC ACT ACT GAA AAT 48 
Asn Lva He Lys Pro Gly. Val Ala Tyr Ser Val He Thr Thr Glu Asn 
15 10 15 

GAC ACA CAG ACT GTG TTC GTA GAT ATG CCA CGT CTT GAG AGA CGC CGG 96 
Asp Thr Gin Thr Val Phe Val Asp Mec Pro Arg Leu Glu Arg Arg Arg 
20 25 30 

GCA AAT CCA AAG GAT GTG GCT GCA GTC ATA CTG GGA GGA GGA GAA GGG 144 
Ala Asn Pro Lys Asp Val Ala Ala Val He Leu Gly Gly Gly Glu Gly 
35 40 45 

ACC AAG TTA TTC CCA CTT ACA AGT AGA ACT GCA ACC CCT GCT GTT.CCG 192 
Thr Lys Leu Phe Pro Leu Thr Ser Arg Thr Ala Thr Pro Ala Val Pro 
50 55 60 

GTT GGA GGA TGC TAC AGG CTA ATA GAC ATC CCA ATG AGC AAC TGT ATC * 240 

Val Gly Gly Cys Tyr Arg Leu He Asp He Pro Met Ser Asn Cys He 
65 70 75 80 

AAC AGT GCT ATT AAC AAG ATT TTT GTG CTG ACA CAG TAC AAT TCT GCT 288 
Asn Ser Ala He Asn Lys He Phe Val Leu Thr Gin Tyr Asn Ser Ala 
85 90 95 

CCC CTG AAT CGT CAC ATT GCT CGA ACA TAT TTT GGC AAT GGT GTG AGC 336 
Pro Leu Asn Arg His He Ala Arg Thr Tyr Phe Gly Asn Gly Val Ser 
100 105 110 

TTT GGA GAT GGA TTT GTC GAG GTA CTA GCT GCA ACT CAG ACA CCC GGG 384 
Phe Gly Asp Gly Phe Val Glu Val Leu Ala Ala Thr Gin Thr Pro Gly 
115 120 125 

GAA GCA GGA AAA AAA TGG TTT CAA GGA ACA GCA GAT GCT GTT AGA AAA 432 
Glu Ala Gly Lys Lys Trp Phe Gin Gly Thr Ala Asp Ala Val Arg Lys 
130 135 140 

TTT ATA TGG GTT TTT GAG GAC GCT AAG AAC AAG AAT ATT GAA AAT ATC 4 80 

Phe He Trp Val Phe Glu Asp Ala Lys Asn Lys Asn He Glu Asn He 
145 150 155 160 

GTT GTA CTA TCT GGG GAT CAT CTT TAT AGG ATG GAT TAT ATG GAG TTG 528 
Val Val Leu Ser Gly Asp His Leu Tyr Arg Met Asp Tyr Met Glu Leu 
165 .170 175 

GTG CAG AAC CAT ATT GAC AGG AAT GCT GAT ATT ACT CTT TCA TGT GCA 576 
Val Gin Asn His He Asp Arg Asn Ala Asp He Thr Leu Ser Cys Ala 
180 185 190 

CCA GCT GAG GAC AGC CGA GCA TCA GAT TTT GGG CTG GTC AAG ATT GAC 624 
Pro Ala Glu Asp Ser Arg Ala Ser Asp Phe Gly Leu Val Lys He Asp 
195 200 205 

" AGC AGA GGC AGA GTA GTC CAG TTT GCT GAA AAA CCA AAA GGT TTT GAT 6"* 2 

Ser Arg Gly Arg Val Val Gin Phe Ala Glu Lys Pro Lys Gly Phe Asp 
210 215 . 220 
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CTT AAA GCA ATG CAA GTA GAT ACT ACT CTT GTT GGA TTA TCT C 
Leu Lys Ala Met Gin Val Asp Thr Thr Leu Val Gly Leu Ser r 
225 230 235 240 

GAT GCG AAG AAA TCC CCC TAT ATT GCT TCA ATG GGA GTT TAT GTA TTC 7 68 

Asp Ala Lys Lys Ser Pro Tyr He Ala Ser Met Gly Val Tyr Val Phe 
245 250 25S 

AAG ACA GAT GTA TTG TTG AAG CTC TTG AAA TGG AGC TAT CCC ACT TCT 816 
Lys Thr Asp Val Leu Leu Lys Leu Leu Lys Tip Ser Tyr Pro Thr Ser 
260 265 270 

AAT GAT TTT GGC TCT GAA ATT ATA CCA GCA" GCT ATT GAC GAT TAC AAT 8 64 

Asn Asp Phe Gly Ser Glu He He Pro Ala Ala He Asp Asp Tyr Asn 
275 280 285 

GTC CAA GCA TAC ATT TTC AAA GAC TAT TGG GAA GAC ATT GGA ACA ATT 912 
Val Gin Ala Tyr He Phe Lys Asp Tyr Trp Glu Asp He Gly Thr He 
290 295 300 

AAA TCG TTT TAT AAT GCT AGC TTG GCA CTC ACA CAA GAG TTT CCA GAG 9 60 

Lys Ser Phe Tyr Asn Ala Ser Leu Ala Leu Thr Gin Glu Phe Pro Glu 
305 310 315 320 

TTC CAA TTT TAC GAT CCA AAA ACA CCT TTT TAC ACA TCT CCT AGG TTC 1008 
Phe Gin Phe Tyr Asp Pro Lys Thr Pro Phe Tyr Thr Ser Pro Arg Phe 
325 330 335 

CTT CCA CCA ACC AAG ATA GAC AAT TGC AAG ATT AAG GAT GCC ATA ATC 1056 
Leu Pro Pro Thr Lys He Asp Asn Cys Lys He Lys Asp Ala He He 
340 345 350 

TCT CAT GGA TGT TTC TTG CGA GAT TGT TCT GTG GAA CAC TCC ATA GTG 1104 
Ser His Gly Cys Phe Leu Arg Asp Cys Ser Val Glu His Ser He Val 
355 360 365 

GGT GAA AG A TCG CGC TTA GAT TGT GGT GTT GAA CTG AAG GAT ACT TTC 1152 
Gly Glu Arg Ser Arg Leu Asp Cys Gly Val Glu Leu Lys Asp Thr Phe' 
370 375 380 

ATG ATG GGA GCA GAC TAC TAC CAA ACA GAA TCT GAG ATT GCC TCC CTG 1200 
Met Met Gly Ala Asp Tyr Tyr Gin Thr Glu Ser Glu He Ala Ser Leu 
385 390 395 400 

TTA GCA GAG GGG AAA GTA CCG ATT GGA ATT GGG GAA AAT ACA AAA ATA 12 4 8 

Leu Ala Glu Gly Lys Val Pro He Gly He Gly Glu Asn Thr Lys He 
405 410 415 

AGG AAA TGT ATC ATT GAC AAG AAC GCA AAG ATA GGA AAG AAT GTT TCA 1296 
Arg Lys Cys He He Asp Lys Asn Ala Lys He Gly Lys Asn Val Ser 
420 425 430 

ATC ATA AAT AAA GAC GGT GTT CAA GAG GCA GAC CGA CCA GAG GAA GGA 134 4 

He He Asn Lys Asp Gly Val Gin Glu Ala Asp Arg Pro Glu Glu Gly 
435 440 445 

TTC TAC ATA CGA TCA GGG ATA ATC ATT ATA TTA GAG AAA GCC ACA ATT 1392 
Phe Tyr He Arg Ser Gly lie He He He Leu Glu Lys Ala Thr He 
450 455 460 

AG A GAT GGA ACA GTC ATC TGAACTAGGG AAGCACCTCT TGTTGAACTA 14 40 

Arg Asp Gly Thr Val He 
465 470 
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CTGGAGATCC AAATCTCAAC TTGAAGAAGG TCAAGGGTGA TCCTAGCAC - 

GACTCCCCGA AGGAAGCTT 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

{A) LENGTH: 470 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asn Lys lie Lys Pro Gly Val Ala Tyr Ser Val He Thr Thr Glu Asn 
1 5 10 15 

Asp Thr Gin Thr Val Phe Val Asp Met Pro Arg Leu Glu Arg Arg Arg 
20 25 30 

Ala Asn Pro Lys Asp Val Ala Ala Val He Leu Gly Gly Gly Glu Gly 
35 40 45 

Thr Lys Leu Phe Pro Leu Thr Ser Arg Thr Ala Thr Pro Ala Val Pro 
50 55 60 

Val Gly Gly Cys Tyr Arg Leu He Asp He Pro Met Ser Asn Cys He 
65 70 75 60 

Asn Ser Ala He Asn Lys He Phe Val Leu Thr Gin Tyr Asn Ser Ala 
85 90 95 

Pro Leu Asn Arg His He Ala Arg Thr Tyr Phe Gly Asn Gly Val Ser 
100 105 110 

Phe Gly Asp Gly Phe Val Glu Val Leu Ala Ala Thr Gin Thr Pro Gly 
115 120 125 

Glu Ala Gly Lys Lys Trp Phe Gin Gly Thr Ala Asp Ala Val Arg Lys 
130 135 140 

Phe He Trp Val Phe Glu Asp Ala Lys Asn Lys Asn He Glu Asn He 

150 155 160 

Val Val Leu Ser Gly Asp His Leu Tyr Arg Met Asp Tyr Met Glu Leu 
165 170 175 

Val Gin Asn His lie Asp Arg Asn Ala Asp He Thr Leu Ser Cys Ala 
180 185 . 190 

Pro Ala Glu Asp Ser Arg Ala Ser Asp Phe Gly Leu Val Lys He Asp 
195 200 205 

Ser Arg Gly Arg Val Val Gin Phe Ala Glu Lys Pro Lys Gly Phe Asp 
210 215 220 

Leu Lys Ala Met Gin Val Asp Thr Thr Leu Val Gly Leu Ser Pro Gin 
225 230 235 240 

Asp Ala Lys Lys Ser Pro Tyr He Ala Ser Met Gly Val Tyr Val Phe 
245 250 255 
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LV3 Thr Asp Val Leu Leu Lys Leu Leu Lys Trp Ser Tyr Pro 
y 260 265 270 

Asn Asp Phe Gly Ser Glu lie lie Pro Ala Ala He Asp Asp Tyr Asn 
275 280 285 

Val Gin Ala Tyr lie Phe Lys Asp Tyr Trp Glu Asp He Gly Thr lie 
290 295 300 

Lvs Ser Phe Tyr Asn Ala Ser Leu Ala Leu Thr Gin Glu Phe Pro Glu 
305 310 315 320 

Phe Gin Phe Tyr Asp Pro Lys Thr Pro Phe Tyr Thr Ser Pro Arg Phe 
325 330 335 

Leu Pro Pro Thr Ly3 He Asp Asn Cys Lys He Lys Asp Ala He He 
340 345 350 

Ser His Gly Cys Phe Leu Arg Asp Cys Ser Val Glu His Ser lie Val 
355 360 365 

Gly Glu Arg Ser Arg Leu Asp Cys Gly Val Glu Leu Lys Asp Thr Phe 
; 370 375 380 

| 

I Met Met Gly Ala Asp Tyr Tyr Gin Thr Glu Ser Glu He Ala Ser Leu 
; 385 390 395 400 

Leu Ala Glu Gly Lys Val Pro He Gly He Gly Glu Asn Thr Lys He 
405 410 415 

: Arq Lys Cys He He Asp Lys Asn Ala Lys He Gly Lys Asn Val Ser 
420 425 430 

; lie He Asn Lys Asp Gly Val Gin Glu Ala Asp Arg Pro Glu Glu Gly 
| 435 440 445 

! Phe Tyr He Arg Ser Gly He He He He Leu Glu Lys Ala Thr He 
j 450 455 460 

! Arg Asp Gly Thr Val He 
465 470 

(2) INFORMATION FOR SEQ ID NO:ll: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 35 base pairs 
(5) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 
GTTGATAACA AGATCTGTTA ACCATGGCGG CTTCC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCAG77AAAA CGGAGCTCAT CAGATGATGA TTC 33 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(iii MOLECULE TYPE: DNA (synthetic) 



(xii SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GTGTGAGAAC ATAAATCTTG GATATGTTAC 30 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 4 : 



GAATTCACAG GGCCATGGCT CTAGACCC 28 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
AAGATCAAAC CTGCCATGGC TTACTCTGTG ATCACTACTG 40 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs _ 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: DNA (synthetic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGGAATTCAA GCTTGGATCC CGGGCCCCCC CCCCCCCCC 39 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GGGAATTCAA GCTTGGATCC CGGG 2 4 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CCTCTAGACA GTCGATCAGG AGCAGATGTA CG 32 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GGAGTTAGCC ATGGTTAGTT TAGAG 25 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs _ 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
GGCCGAGCTC GTCAACGCCG TCTGCGATTT GTGC 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE.: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GATTTAGGTG ACACTATAG 
(2) INFORMATION FOR SEQ ID NO:22: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
AGAGAGATCT AGAACAATGG CTTCCTCTAT GCTCTCTTCC GC 42 
(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GGCCGAGCTC TAGATTATCG CTCCTGTTTA TGCCCTAAC 



39 
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10 



15 



20 



25 



30 



Claims 

1. A method for increasing the starch content of a 
plant which comprises altering said plant to increase the 
ADPglucose pyrophosphorylase activity in said plant 

2. A method of producing genetically transformed 
plants which have elevated starch content, comprising the steps 
of: 

(a) inserting into the genome of a plant cell a 
recombinant, double-stranded DNA 
molecule comprising 

(i) a promoter which functions in 
plants to cause the production of an 
RNA sequence in the target plant 
tissues, 

(ii) a structural DNA sequence that 
causes the production of an RNA 
sequence which encodes a fusion 
polypeptide comprising an amino- 
terminal plastid transit peptide and 
an ADPglucose pyrophosphorylase 
enzyme, 

(Hi) a 3* non-translated DNA sequence 
which functions in plant cells to 
cause transcriptional termination 
and the addition of polyadenylated 
nucleotides to the 3* end of the RNA 
sequence; 

(b) obtaining transformed plant cells; and 
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(c) regenerating from the transformed plant 
cells genetically transformed plants which have an 
elevated starch content. 

5 3, A method of claim 2 in which the ADPglucose 

pyrophosphorylase enzyme is deregulated. 

4. A method of claim 3 in which the ADPglucose 
pyrophosphorylase enzyme is from bacteria. 

10 

5. A method of claim 3 in which the ADPglucose 
pyrophosphorylase enzyme is from plants or algae. 

6. A recombinant, double-stranded DNA molecule 
15 comprising in sequence: 

(a) a promoter which functions in plants to 
cause the production of an RNA sequence 
in the target plant tissues; 

(b) a structural DNA sequence that causes the 
20 production of an RNA sequence which 

encodes a fusion polypeptide comprising an 
ammo-terminal plastid transit peptide and 
an ADPglucose pyrophosphorylase 
enzyme; and 

25 (c) a 3' non-translated region which functions 

in plant cells to cause transcriptional 
termination and the addition of 
polyadenylated nucleotides to the 3' end of 
the RNA sequence, 

30 said promoter is heterologous with respect to said structural 
DNA. 
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7 A DNA molecule of claim 6 in which the 
ADPglucose pyrophosphorylase enzyme is deregulated. 

8. A DNA molecule of claim 6 in which the plastid 
transit peptide is heterologous to the source of the ADPglucose 
pyrophosphorylase structural DNA. 

9. A DNA molecule of claim 8 in which the 
ADPglucose pyrophosphorylase is from bacteria. 

10. A plant cell comprising a recombinant, double- 
stranded DNA molecule comprising in sequence: 

(a) a promoter which functions in plants to 
cause the production of an RNA sequence 
in target plant tissues; 

(b) a structural DNA sequence that causes the 
production of an RNA sequence which 
encodes a fusion polypeptide comprising an 
amino-terminal plastid transit peptide and 
an ADPglucose pyrophosphorylase 
enzyme; and 

(c) a 3* non-translated region which functions 
in plant cells to cause transcriptional 
termination and the addition of 
polyadenylated nucleotides to the 3* end of 
the RNA sequence 

in which the DNA molecule is foreign to said plant cell. 

11. A plant cell of claim 11 in which the ADPglucose 
pyrophosphorylase enzyme is deregulated. 
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12. A plant cell of claim 11 in which the promoter is 
heterologous with respect to the ADPglucose pyrophosphorylase 
structural DNA. 

5 13. A plant cell of claim 12 in which the plastid 

transit peptide is heterologous to the source of the ADPglucose 
pyrophosphorylase structural DNA. 

14. A plant cell of claim 13 in which the ADPglucose 
10 pyrophosphorylase is from bacteria. 

15. A plant cell of claim 10 selected from the group 
consisting of corn, wheat, rice, carrot, onion, pea, tomato, potato 
and sweet potato, peanut, canola/oilseed rape, barley, sorghum, 

15 cassava, banana, soybeans, lettuce, apple and walnut. 

16. A plant consisting of plant cells of claim 10. 

17. A plant of claim 16 in which the ADPglucose 
20 pyrophosphorylase enzyme is deregulated. 

18. A plant of claim 17 in which the promoter is 
heterologous to the source of the ADPglucose pyrophosphorylase 
structural DNA. 

25 

19. A plant of claim 17 in which the plastid transit 
peptide is heterologous to the source of the ADPglucose 
pyrophosphorylase structural DNA. 



30 20. A plant of claim 18 in which the ADPglucose 

pyrophosphorylase is from bacteria. 
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21. A potato plant cell of claim 10. 

22. A potato plant cell of claim 14. 
5 23. A potato plant of claim 18. 

24. A potato plant of claim 20. 

25. A potato plant of claim 22 which is vaf. Russet- 
10 Burbank. 

26. A potato plant of claim 24 which is var. Russet- 
Burbank. 

15 27. A method of claim 1 in which said plant is potato. 

28. A method of claim 2 in which said plant is potato. 

29. A method of claim 3 in which said plant is potato. 

20 

30. A method of claim 4 in which said plant is potato. 

31. A tomato plant cell of claim 10. 
25 32. A tomato plant cell of claim 14. 

33. A tomato plant of claim 16. 

34. A tomato plant of claim 20. 

35. A method of claim 1 in which said plant is 

tomato. 



30 



WO 91/19806 PCT/US9 1/04036 



36. A method of claim 2 in which said plant is 

tomato. 

37. A method of claim 3 in which said plant is 

5 tomato. 

38. A method of claim 4 in which said plant is 

tomato. 
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DNA ATGGTTAGTTTAGAGAAGAACGATCACTTAATGTTGGCGCGCCAGCTGCCATTGAAATCT 

t 1 1 1 1 1 "! 

Protein MVSLEKNDHLMLARQLPIKS 

GTTGCCCTGATACTGGCGGGAGGACGTGGTACCCGCCTGAAGGATTTAACCAATAAGCGA 

61 , 1 1 1 1 1 

VALILAGGRGTRLKDLTNKR 

GCAAAACCGGCCGTACACTTCGGCGGTAAGTTCCGCATTATCGACTTTGCGCTGTCTAAC 

121 1 1 1 1 1 1 

AKPAVHFGGKFRIIDFALSN 

TGCATCAACTCCGGGATCCGTCGTATGGGCGTGATCACCCAGTACCAGTCCCACACTCTG 

l81 1 1 1 1 1 

C1NSG1RRMGV1TQYQSHTL 

GTGCAGCACATTCAGCGCGGCTGGTCATTCTTCAATGAAGAAATGAACGAGTTTGTCGAT 

241 1 1 1 1 1 1 

VQHIQRGVSFFNEEMNEFVD 

CTGCTGCCAGCACAGCAGAGAATGAAAGGGGAAAACTGGTATCGCGGCACCGCAGATGCG 

301 L L P A Q Q R M K G E N V Y R G T A D A 

GTCACCCAAAACCTCGACATTATCCGTCGTTATAAAGCGGAATACGTGGTGATCCTGGCG 

361 1 1 1 1 1 1 

VTQNLD1 IRRYKAEYVV1LA 

GGCGACCATATCTACAAGCAAGACTACTCGCGTATGCTTATCGATCACGTCGAAAAAGGT 
421 GDHIYKQDYSRML1DHVEKG 

GTACGTTGTACCGTTGTTTGTATGCCAGTACCGATTGMGAAGCCTCCGCATTTGGCGTT 
481 VRCT VVCHPVPIEEASAFGV 



ATGGCGGTTGATGAGAACGATAAAACTATCGAATTCGTGGAAAAACCTGCTAACCCGCCG 

541 1 1 1 1 1 1" 1 

MAVDENDKTIEFVEKPANPP 

TCAATGCCGMCGATCCGAGCAAATCTCTGGCGAGTATGGGTATCTACGTCTTTGACG.ee 
601 "(TP NDPSKSLASMGIYVFDA 

GAeTATCTGTATGAACTGCTGGAAGAAGACGATCGCGATGAGAACTCCAGCCACGACTTT 

66l | 1 1 ^1 1 1 

DYLYELLEEDDRDENSSHDF 

FIG.1A 
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. G6CAAAGATTTGATTCCCAAGATCACCGAAGCCGGTCTGGCCTATGCGCACCCGTTCCCG 
72i , , , , , ( 

GKDL I P K I TEAGLAYAHPFP 

CTCTCTTGCGTACAATCCGACCCGGATGCCGAGCCGTACTGGCGCGATGTGGGTACGCTG 
781 1 , , , ( j 

LSCVQSDPDAEP YURDVGTL 
GAAGCTTACTGGAAAGCGAACCTCGATCTGGCCTCTGTGGTGCCGAAACTGGATATGTAC 



EAYVKANLDLASVVPKLDMY 

GATCGCAATTGGCCAATTCGCACCTACAATGAATCATTACCGCCAGCGAAATTCGTGCAG 
901 1 , 1 , ( 

DRNWPIRTYNESLPPAKFVQ 

GATCGCTCCGGTAGCCACGGGATGACCCTTAACTCACTGGTTTCCGGCGGTTGTGTGATC 
%l , , , , j ( 

DRSGSHGMTLNSLVSGGCVI 

TCCGGTTCGGTGGTGGTGCAGTCCGTTCTGTTCTCGCGCGTTCGCGTGAATTCATTCTGC 
l02l , , , , ( ( 

SGSVVVQSVLFSRVRVNSFC 

AACATTGATTCCGCCGTATTGTTACCGGAAGTATGGGTAGGTCGCTCGTGCCGTCTGCGC 
1081 1 -| 1 , , , 

N.IDSAVLLPEVVVGRSCRLR 
CGCTGCGTCATCGATCGTGCTTGTGTTATTCCGGAAGGCATGGTGATTGGTGAAAACGCA 



RC VIDRACVIPE GMVIGENA 

^ gaggaagatgcacgtcgtttctatcgttcagaagaaggcatcgtgctggtaacgcgcgaa 
eedarrfyrseegivlvtre' 
atgctacggaagttagggcataaacaggagcgataa 

1261 1 1 1 

MLRKLGHKQER* 
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DNA ATGGTTAGTTTAGAGAAGAACGATCAC7TAATGTTGGCGCGCCAGCTGCCAT7GAAATCT 

! , , 1 1 1 1 

Protein MVSLEKNDHLMLARQLPLKS 

G77GCCC7GA7AC7GGCGGGAGGACG7GG7ACCCGCC7GAAGGA777AACCAATAAGCGA 

6l 1 1 1 1 1 1 

YAL1LAGGRGTRLKDLTMKR 

GCAAAACCGGCCGTACACTTCGGCGGTAAGTTCCGCATTATCGACTTTGCGCTGTCTAAC 

l21 1 1 1- 1 1 1 

AKPAVHFGGKFRI IDFALSN 

TGCATCAACTCCGGGATCCGTCGTATGGGCGTGATCACCCAGTACCAGTCCCACACTCTG 

C 1 NSG 1 RRMGV I T OYOSHT L 

GTGCAGCACATTCAGCGCGGCTGGTCATTCTTCAATGAAGAAATGAACGAGTTTGTCGAT 

241 VQHIQRGVSFFNEEMNEFVD 

CTGCTGCCAGCACAGCAGAGAATGAAAGGGGAAAACTGGTATCGCGGCACCGCAGATGCG 

30l 1 1 — -! 1 1 1 

LLPA. QQRMKGENUYRGTADA 

G7CACCCAMACCTCGACATTATCCGTCGTTATAAAGCGGAATACGTGGTGATCCTGGCG 

361 , 1 1 1 1 1 

V7QNLD I IRRYKAEYVV1LA 

GGCGACCATATCTACAAGCAAGACTACTCGCGTATGCTTATCGATCACGTCGAAAAAGGT 

42l 1 1 1 1 1 ~l 

GDHIYKODYSRHLI DHVEKG 

GTACGTTGTACCGTTGTTTGTATGCCAGTACCGATTGAAGAAGCCTCCGCATTTGGCGTT 
481 VRCTVVCMPVP1EEASAFGV 

ATGGCGGTTGATGAGAACGATAAAACTATCGAATTCGTGGAAAAACCTGCTAACCCGCCG 
541 MAVDENDKT1EFVEKPANPP 

TCAATGCCGAACGATCCGAGCAAATCTCTGGCGAGTATGGGTATCTACGTCTTTGACGCC 



601 



., , 1 1 

SMPNDPSKSIASMG1YVFDA 



GACTATCTGTATGAACTGCTGGAAGAAGACGATCGCGATGAGAACTCCAGCCACGACTTT 

661 1 ! 1 1 1 1 

DYLYELLEEDDRDENSSHDF 

GGCAAAGATTTGATTCCCAAGATCACCGAAGCCGGTCTGGCCTATGCGCACCCGTTCCCG 
721 G K D L 1 P K I T E A G L A Y A H P F P ^ ^ 
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CTCTCTTGCGTACAATCCGACCCGGATGCCGAGCCGTACTGGCGCGATGTGGGTACGCTG 
78l , , , _ ( ( 

LSCVOSDPDAEPYVRDVGTL 

GAAGCTTACTGGAAAGCGAACCTCGATCTGGCCTCTGTGGTGCCGGAACTGGATATGTAC 
841 1 1 , 1 i 

EAYVKANLDLAS VVPELDMY 

GATCGCAATTGGCCAATTCGCACCTACAATGAATCATTACCGCCAGCGAAATTCGTGCAG 
901 , , , 1 

D R N V P I R T Y N E S L P P A K F V Q 

. GATCGCTCCGGTAGCCACGGGATGACCCTTAA'CTCACTGGTTTCCGACGGTTGTGTGATC 
961 , , , ( ( 

DRSGSHGMTLNSLVSDGCVI 
j 02J ^ GTTCGGTGGT ^ T GCAGTCCGTTCTGTTCTCGCGCGTTCGCGTGAATTCATTCTGC 

sgsvvvqsvlfsrvrvn's TT' 
aacattgattccgccgtattgttaccggaagtatgggtaggtcgctcgtgccgtctgcgc 

1081 1 1 1 1- , , 

N I D S A V L L P E V V V G R S C R L R 

1141 ^^^ TCG ^ G ^^T GTGTTATTCCGG AAGGCATGGTGAnGGTGAAAACGCA 
RCVIDRACVIPEGMVI g'TTT' 

gaggaagatgcacgtcgtttctatcgttcagaagaaggcatcgtgctggtaacgcgcgaa 

1201 1 , ! , ! 

EEDARRFYRSEEGIVLVTRE 

ATGCTACGGAAGTTAGGGCATAAACAGGAGCGATAA 
1261 1 , , 

MLRKLGHKQER* 
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H 



ioBcttgttctcattgtt^^^ 60 

{ 1 1— i 

cagtcacacaaaga^^ l2Q 

6l 1 1- - " I 

M A S S M L S S A T M 

gttgcctctccggctca^^ 180 

12l 1 1 - - 

V A S P A 0 A T H V A P F N G L K S S A 

gccttcccagccac*^^ 240 

181 1 1 

, F P A T R K A N N D 1 T S I T S N G G 

agagttaurtgca^ 30Q 

24 1 1 1 

RVNCMQVWPPI 6KKKFETLS 

N 
c 

0 

1 



taccttcrtgacc^accgattccggtggtcgcg^gcat^^ 



355 



301 



Y 



L P D L T D S G G R V N C M Q A M 
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60 



1 CCATGGCGGCTTCCATTGGAGCCTTAAAATCTTCACCTTCTTCTAACAATTGCATCAATG 
MetAlaAlaSerlleGlyAlaLeuLysSerSerProSerSerAsnAsnCysI leAsnG 



61 



121 



181 



120 



180 



240 



AGAGAAGAAATGATTCTACACGTGCTGTATCCAGCAGAAATCTCTCATTTTCGTCTTCTC 
luAroArqAsnAspSerThrArgAlaValSerSerArgAsnLeuSerPheSerSerSerH 
ATCTCGCCGGAGACAAGTTGATGCCTGTATCGTCCTTACGTTCCCAAGGAGTCCGATTCA 
isLeuAlaGlyAspLysLeuMetProValSerSerLeuArgSerGlnGlyValArgPheA 
i o i ATGTGAGAAGAAGTCCAATGATTGTGTCGCCAAAGGCTGTTTCTGATTCGCAGAATTCAC 
snValAroAroSerProMetlleValSerProLysAlaValSerAspSerGlnAsnSerG 
241 AGACATGTCTAGACCCAGATGCTAGCCGGAGTGTTTTGGGAATTATTCTTGGAGGTGGAG 300 

InThrCysLeuAspProAspAlaSerArgSerValLeuGlyllelleLeuGlyGlyGtyA 
301 CTGGGACCCGACTTTATCCTCTAACTAAAAAAAGAGCAAAGCCAGCTGTTCCACTTGGAG 360 

taGlyThrArgLeuTyrProLeuThrLysLysArgAlaLysProAlaValProLeuGlyA 
361 CAAATTATCGTCTGATTGACATTCCTGTAAGCAACTGCTTGAACAGTAATATATCCAAGA 420 

laAsnTyrArqLeuIleAspIleProValSerAsnCysLeuAsnSerAsnlleSerLysI 
421 TTTATGTTCTCACACAATTCAACTCTGCCTCTCTGAATCGCCACCTTTCACGAGCATATG 480 

leTvrValLeuThrGlnPheAsnSerAlaSerLeuAsnArgHisLeuSerArgAUTyrA 
481 CTAGCAACATGGGAGGATACAAAAACGAGGGCTTTGTGGAAGTTCTTGCTGCTCAACAAA 540 

i a ser AsnMetG I yG I yTyrLysAsnG I uG I yPheVa I G luVa ILeuA I aA I aG InG I n S 
541 GTCCAGAGAACCCCGATTGGTTCCAGGGCACGGCTGATGCTGTCAGACAATATCTGTGGT 600 

e rProGluAsnProAspTrpPheGlnGlyThrAlaAspAlaValArgGlnTyrLeuTrpL 
601 TGTTTGAGGAGCATACTGTTCTTGAATACCTTATACTTGCTGGAGATCATCTGTATCGAA 660 

euPheG luG luH i sThrVa ILeuG luTyrLeul \ eLe^loGlyte^lsLei^^QM 
661 TGGATTATGAAAAGTTTATTCAAGCCCACAGAGAAACAGATGCTGATATTACCGTTGCCG 720 

p-tAsDTvrGluLvsPhel leGlnAlaHisArgGluThrAspAlaAspl leThrValAlaA ^ 
721 CACTGCCAATGGACGAGAAGCGTGCCACTGCATTCGGTCTCATGAAGATTGACGAAGAAG 780 

laLeuProMetAspGluLysArgAlaThrAlaPheGlyLeuMetLyslleAspGluGluG 
781 GACGCATTATTGAATTTGCAGAGAAACCGCAAGGAGAGCAATTGCAAGCAATGAAAGTGG 840 

lyArgllelleGluPheAlaGluLysProGlnGlyGluGlnLeuGtnAlaHeiLysYalA 
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841 ATACTACCATTTTAGGTCTTGATGACAAGAGAGCTAAAGAAATGCCTTTCATTGCCAGTA 900 
spThrThrl leLeuGlyLeuAspAspLysArgAlaLysGluMetProPhel leAlaSerM 

901 TGGGTATATATGTCATTAGCAAAGACGTGATGTTAAACCTACTTCGTGACAAGTTCCCTG 960 
etGlyl leTyrVall leSerLysAspValHetLeuAsnLeuLeuArgAspLvsPheProG 

961 GGGCCAATGATTTTGGTAGTGAAGTTATTCCTGGTGCAACTTCACTTGGGATGAGAGTGC 1 020 
I yA I aAsn AspPheG I ySerG I uVa III eProG I yA UThrSerLeuG I yMet Arc Va I G 

1 021 AAGCTTATTTATATGATGGGTACTGGGAAGATATTGGTACCATTGAAGCTTTCTACAATG 1 080 
I n A I a TyrLeuTyr AspG I yTyr TrpG I uAsp I leGl yThr I leGluAl aPheTyr Asn A 

1 081 CCAATTTGGGCATTACAAAAAAGCCGGTGCCAGATTTTAGCTTTTACGACCGATCAGCCC 1140 

laAsnLeuGlylleThrLysLysProVolProAspPheSerPheTyrAspArqSerAlaP 
1141 CAATCTACACCCAACCTCGATATCTACCACCATCAAAAATGCTTGATGCTGATGTCACAG 1200 

roIleTyrThrGlnProArgTyrLeuProProSerLysMe-tLeuAspAlaAspValThrA 
1201 ATAGTGTCATTGGTGAAGGTTGTGTGATCAAGAACTGTAAGATTCATCATTCCGTGGTTG 1260 

spSer Va III eG I yG luC lyCysVa II I eLysAsnCysLys I leHisHi sSerVa IValG 
1261 GACTCAGATCATGCATATCAGAGGGAGCAATTATAGAAGACTCACTTTTGATGGGGG'CAG 1320 

I yLeuArgSerCys I leSerGluGlyAlal lei leGl uAspSerLeuLeuMe-tG lyAlaA 
1321 ATTACTATGAGACTGATGCTGACAGGAAGTTGCTGGCTGCAAAGGGCAGTGTCCCAATTG 1380 

spTyrTyrG I uThrAspA I aAspArgLysLeuLeuA I qA I aLysG I ySer Va I Pro II eG 
1381 GCATCGGCAAGAATTGTCACATTAAAAGAGCCATTATCGACAAGAATGCCCGTATAGGGG 1440 

lyl leGlyLysAsnCysHlsI leLysArgAlal lei leAspLysAsnAUArgl leGlyA 
1441 ACAATGTGAAGATCATTAACAAAGACAACGTTCAAGAAGCGGCTAGGGAAACAGATGGAT 1500 

spAsn ValLysIlell e AsnLysAspAsn ValGlnGluAlaAl aArgG I uThr AspG I y T 
1501 ACTTCATCAAGAGTGGGATTGTCACCGTCATCAAGGATGCnTGATTCCAAGTGGAATCA 1560 

yrPhe I leLysSerG lylleVol ThrVa 1 1 1 eLysAspA I aLeu 1 1 eProSer G lyl lei 
1561 TCATCTGATGAGCTC 1575 . Y 

lelleEndEnd 
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1 AACAAGATCAAACCTGGGGTTGCTTACTCTGTGATCACTACTGAAAATGACACACAGACT 60 

AsnLvsIleLysProGlyValAlaTyrSerVallteThrThrGluAsnAspThrGlnThr 
61 GTGTTCGTAGATATGCCACGTCTTGAGAGACGCCGGGCAAATCCAAAGGATGTGGCTGCA 120 

ValPheValAspMetProArgLeuGluArgArgArgAlaAsnProLysAspValAlaAla 
121 GTCATACTGGGAGGAGGAGAAGGGACCAAGTTATTCCCACTTACAAGTAGAACTGCAACC 180 

VallleLeuGlyGlyGlyGluGlyThrLysLeuPheProLeuThrSerArgThrAlaThr 
181 CCTGCTGTTCCGGTTGGAGGATGCTACAGGCTAATAGACATCCCAATGAGCAACTGTATC 240 

ProAlaValProValGlyGlyCysTyrArgLeuIleAspIleProMetSerAsnCysIle 
241 AACAGTGCTATTAACAAGATTTTTGTGCTGACACAGTACAATTCTGCTCCCCTGAATCGT 300 

AsnSerAlal leAsnLys! lePheValLeuThrGlnTyrAsnSerAlaProLeuAsnArg 
301 CACATTGCTCGAACATATTTTGGCAATGGTGTGAGCTTTGGAGATGGATTTGTCGAGGTA 360 

HlsIleAlaArgThrTyrPheGlyAsnG^ 
361 CTAGCTGCAACTCAGACACCCGGGGAAGCAGGAAAAAAATGGTTTCAAGGAACAGCAGAT 420 

LeuAtaAlaThrGlnThrProGlyGluAlaG^ 
421 GCTGTTAGAAAATTTATATGGGTTTTTGAGGACGCTAAGAACAAGAATATTGAAAATATC 480 

A I aVa I ArqLy sPhe 1 1 eTrpVa I PheG I uAspA I aLy sAsnLysAsn 1 1 eG luAsn 1 e 
481 GTTGTACTATCTGGGGATCATCTTTATAGGATGGATTATATGGAGTTGGTGCAGAACCAT 540 

ValValLeuSertHyAspHlsLeuT^^ 
541 ATTGACAGGAATGCTGATATTACTCTTTCATGTGCACCAGCTGAGGACAGCCGAGCATCA 600 

1 leAspArgAsnAUAspl leThrLeuSerCysAUProAlaGluAspSerArgAlaSer 
601 GATTTTGGGCTGGTCAAGATTGACAGCAGAGGCAGAGTAGTCCAGTTTGCTGAAAAACCA 660 

AspPheGlyLeuValLysl leAspSerArgGlyArgValValGlnPheAUGluLysPro 
661 AAAGGTTTTGATCTTAAAGCAATGCAAGTAGATACTACTCTTGTTGGATTATCTCCACAA 720 

LysG lyPheAspLeuLysA laMetG In Va lAspThrThrLeuVa IG lyLeuSerProG In 
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721 GATGCGAAGAAATCCCCCTATATTGCTTCAATGGGAGTTTATGTATrCAAGACAGATGTA 780 

AspAlaLysLysSerProTyrI leAlaSerMetGlyValTyrValPheLvsThrA<;n\/n I 
781 TTGTTGAAGCTCTTGAAATGGAGCTATC^ 840 

o,, ^^r LysLeuLeuLysTr P SerT y rProThrSerA snAspPheGlySerGluIleIt P 

841 CA A TATTGACGATTACAATGTCCAAGCATACATTTTCAAAGACTAT 900 

ProAl ttAlaIleAspAspTyrAsnVQlGlnAlQTyrIl e PheLysAspTvrTrnGlu^n 
901 ATTGGAAC^ 

Phpn Ph i C ^TCCAAAAACACCTTTTTACACATCTCCTAGGTTCCTTCCACCAACC 1030 
P ^ G ! nPheT y rAs P Prol -y sThrp roPheTyrThrSerProArQPheLeuProProThr 
1 031 AAGATAGACAATTGCAAGATTAAGGATGCCATAATCTCTC^^ 1 080 

, noi "-ysl leAspAsnCysLysI leLysAspAlal (el leSerHisGlyCysPheLeuAraAso 

loei gtctgtggaacactccatagtgggtgaaagatcgcgcttagattg™ 

Cy^rVQlGluHlsSerlleValGlyGluArgSerArgLeuAspCysGlvValGluLeu 
1141 AAGGATAC TTTCATGATGGGAGCAGACTACTACC AAACAGAATCTGAGATTK^ 1200 

1201 TTAGCAGAGGGGAAAGTACCGATTGGAATTGGGGAAAATACAAAAATAAGGAAATGTATC 1260 

LeuAlaGluGlyLysValProIleGlylleGlyGluAsnThrLysIleAroLysCvsIle 

1261 ATTGACAAGAACGCAAAGATAGGAAAGAATGTTTCAATCATAAATAAAGACGGTGTTCAA 1320 

I ^ spL y sAsnAlaL y sIleGl y L ysAsnValSerIleIleAsnLysAspGlvValGln 
1321 GAGG A GACCGACCAGAGGAAGGAm 

G I uA I QAspAr gProG I uG I uG I y PheTyr I leArgSerGlyl lei lei lei leLeuGlu 

1381 AAAGCCACAATTAGAGATGGAACAGTCATCTGAACTAGGGAAGCACCTCTTGTTGAACTA 1440 
LysAlaThrl leArgAspGlyThrVall leEnd 
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